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Boris Springborn Geometry I Lecture 1 Winter Semester 07/08 


Introduction 


It seems natural that a course entitled Geometry should begin with the question: 
What is geometry? 


Right now, I would like to answer this question in the form of a short historic overview of the 
subject. Geometry is, after all, something that people have been doing for a very long time. 
The following brief history of geometry will be incomplete, inaccurate (true history is much more 
complicated) and biased (we will ignore what happened in India or China, for example). It is a 
shortened, smoothed out version of history that is meant only as a rough explanation of how the 
material that will be covered in this course came into being. 


The word geometry comes from the Greek word yewyeteta, which is a composite of the words 
for earth and measure. Geometry began as the science of measuring the earth, or surveying, and 
it began ~2000 BC in Egypt and Mesopotamia (Babylon, in today’s Iraq). These were among 
the first great civilizations and they depended on agriculture along the rivers Nile in Egypt, and 
Tigris and Euphrates in Mesopotamia. These rivers would periodically inundate and fertilize the 
surrounding land, which made periodic surveying necessary to delimit the fields. The science 
of geometry developed from this, with applications also in construction and astronomy. The 
Egyptians and Babylonians could compute areas and volumes of simple geometric figures, they 
had some approximations for 7, and they already knew Pythagoras’ theorem. Strangely though, 
no records of general theorems or proofs have survived from this period. Egyptian papyri and 
Babylonian clay tables with their cuneiform script contain only worked exercises. Maybe they 
did not state general theorems, maybe they just did not write them down, or maybe they did but 
these documents did not survive. Basically we have no idea how they conducted their research. 


This changed with the period of Greek geometry (Thales ~600 BC to Euclid ~300 BC). They 
clearly stated general theorems for which they gave proofs. That is, they deduced more compli- 
cated statements from simpler ones by logical reasoning. This suggests putting all statements in 
order so that each statement is proved using only statements that have previously been proved. 
By necessity, one must begin with a few (as few as possible) hopefully very simple statements 
that are accepted without proof. In Euclid’s Elements, geometry (most or even all of what was 
known at the time) is presented in this form. It begins with a few definitions and postulates 
(today we say axioms) from which all theorems are deduced one by one. These postulates were 
simple statements like “there is a unique straight line through two points” and “two lines in- 
tersect in a unique point or they are parallel”. But one of the postulates was considered more 
complicated and less obvious than the others, the parallel postulate: “Given a line and a point 
not on the line, there is a unique parallel to the line through the point.” For centuries to come, 
people tried to prove this one postulate using the other, simpler ones, so that it could be elim- 
inated from the unproved postulates. One way people tried to prove the parallel postulate was 
to assume instead that there are many parallels and derive a contradiction. But even though 
some strange theorems could be deduced from this alternative parallel postulate (like that there 
is an upper bound for the area of triangles) no true contradiction would appear. This finally lead 
to the realization that the alternative parallel postulate did not contradict the other postulates. 
Instead it leads to a logically equally valid version of geometry which is now called hyperbolic 
geometry (Lobachevsky 1829, Bélyai 1831). Later it was realized that one may also assume that 
there are no parallels (the other postulates also have to be changed a little for this), and the 
resulting geometry is called elliptic geometry. This is simply the geometry on the sphere, where 
pairs of opposite points are considered as one point, and lines are great circles. Both hyperbolic 
and elliptic geometry are called non-Euclidean geometries, because their axioms are different 
from Euclid’s. 


Another important development in geometry was the introduction of coordinates by Descartes 
and Fermat in the first half of the 17th century. One could then describe geometric figures and 
prove theorems using numbers. This way of doing geometry was called analytic geometry, as 
opposed to the old way beginning with geometric axioms, which was called synthetic geometry. 
By the late 19th/early 20th century, it was proved that both approaches are in fact equivalent: 
One can either start with axioms for numbers and use them to define the objects of geometry, 
or one can start with axioms of geometry and define numbers geometrically, one gets the same 
theorems. 
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The study of the rules of perspective in painting (da Vinci & Diirer ~1500) lead to the devel- 
opment of projective geometry (Poncelet, 1822), dealing with the question: Which properties of 
geometric figures do not change under projections? For example, straight lines remain straight 
lines, but parallels do not remain parallel. 


Another type of geometry is Mobius geometry, which deals with properties that remain unchanged 
under transformations mapping circles to circles (such as inversion on a circle). Then there is 
also Lie geometry (about which I will say nothing now) and there are other types of geometry. 
Klein’s Erlangen Program (1871) provided a systematic treatment of all these different kinds of 
geometry and their interrelationships. It also provided a comprehensive and maybe surprising 
answer to the question: What is geometry? We will come back to this. 


Contents of this course 


spherical geometry 
hyperbolic geometry 
projective geometry 
Mobius geometry 
Lie geometry 

some other stuff 


Spherical geometry 


The n-dimensional unit sphere is S” = {2 € R"*? | ||ax|| = 1} C R®*1, where ||z|| = \/(2, z) 
and (x,y) = >) a;y; is the standard Euclidean scalar product. We will consider mainly S? C R°. 
A great circle in S$ is the intersection of S? with a plane through the origin, E = {x € R"*? | 
(x,n) = O}, ||n|| = 1. (The intersection with a plane not through 0 is called a small circle.) The 
points +n € S? are called the poles of the great circle. For x,y € $7, « 4 +y, there is a unique 
great circle through x and y. (If « = +y, there is a one parameter family.) Two great circles 
intersect in two diametrically opposite points. 


Theorem. The shortest continuously differentiable curve connecting two points x,y € S? is the 
shorter arc of the great circle through x and y. Its length is d(x, y) := arccos(, y). 


Proof. The second sentence of the theorem is clear. To prove the first sentence, let + : [to, ti] > 
S? be a continuously differentiable curve from x to y. We have to show length(y) > d(z,y), 
with equality only if + is an arc of a great circle. We may assume that +(t) = « only for t = to 
and y(t) = y only for t = t,. Let f(p) = arccos(x,p), which is defined where |(x,p)| < 1 


and differentiable where strict inequality holds. For p € S? \ {a,—zx} let u(p) = grad f(p) — 
(grad f(p),p) p. A direct calculation shows that ||v|| = 1. Then 


ty 


a(e,y) = f@)-F(e) = i. “(grad f((t)),7(t)) dt = i (o(y(t),7(t)) dt < / “ [ly (@)|| at = lengt(y), 


to to to 


where we have used (y’,7) = 0 and the Cauchy-Schwarz inequality. Equality only holds if y’ 
is always in the direction of v(y). This can be used to show that y must be an arc of a great 
circle. 


In fact, this proof works also if 7 is only assumed piecewise C!. 


Corollaries. (i) The shortest curves connecting two opposite points are halves of great circles. 
(ti) The function d(x, y) is a metric on the sphere. 
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Hemispheres and digons 


A hemisphere is the intersection of $? with a half-space 


H={xeER*|(a,n) 20}, [nl] =1. 


A hemisphere is bounded by a great circle. One of its poles, n, lies inside the hemisphere, the 
other, —n, outside. 


A spherical digon is the intersection of two hemispheres. 


The interior angle a and the exterior angle & of the digon satisfy a+ @ = m and cos@ = (m,n), 
so cosa = —(m,n). The exterior angle is the spherical distance between the poles m and n. 


The area of the digon is 2a. 


(One could alternatively define a spherical digon as a spherical region bounded by two half great 
circles. Then the interior angle a could exceed 7. The formula for the area would still hold.) 


Spherical triangles 


There are actually several different sensible definitions for spherical triangles: 


1. 


The intersection of three hemispheres with poles not on one great circle. Such triangles are 
called Euler triangles. 

A spherical region bounded by three great circular arcs. Such triangles are either an Euler 
triangle, the outside of an Euler triangle, the union of an Euler triangle and a hemisphere, or 
the difference of a hemisphere and an Euler triangle. 

An oriented closed curve on the sphere consisting of three great circular arcs. (The arcs may 
intersect each other.) Such spherical triangles were considered by Mébius. To measure the 
angles of such a triangle sensibly, the sphere must be given an orientation. 

Finally, a spherical triangle in the sense of Study consists of (i) three points A,B,C € S?, 
(ii) the three great circles gag, gsc, gca through A and B, B and C, C and A, each of which 
is given an arbitrary orientation, and (iii) numbers a,b,c,@,8,4 € R such that: (a) If one 
moves from A along gag the signed distance c, then one reaches B. Similarly for the other 
points and great circles. (b) If one rotates gap around A by the signed angle 4, it is moved 
into gca with correct orientation. Similarly for the other points and great circles. Triangles 
are considered different even if the corresponding numbers differ only by integer multiples 
of 7. 
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We will now consider Euler triangles, and “spherical triangle” will mean “Euler triangle”, unless 
stated otherwise. 


In the definition above, an Euler triangle is defined in terms of hemispheres. But an Euler 
triangle is also determined by its vertices: 


Suppose A, B,C € S? do not all lie on one great circle. (That is, A, B,C are linearly independent 
unit vectors in R*). Then the intersection of S$? with the cone 


C={\A+yB+0C € R?| A, p,v > 0} 

is an Euler triangle. Indeed, there are unique unit vectors A’, B’,C’ € S$? with 

(A’, A) > 0, (A’, B) =0, (A’,C) =0, 

(Bi Ay =O: (B’, B) > 0, (BY,C) =0, 

(C’, A) =0, (C, By = 0; (C’,C) >0, 
and C is the intersection of the half-spaces Hy’, Hp, Ho’, where Hx = {x € R® | (X, x) > O}. 
The side lengths a,b,c and exterior angles a, 8, 4 (see figure above) are determined by 

cosa = (B,C), cos @ = (B’,C"), etc. 
The interior angles are a = 1 — G, etc. 


The triangle with vertices A’, B’,C’ is called the polar triangle of the triangle with vertices 
A,B,C. 


The polar triangle of the polar triangle is the original triangle. 


The side lengths of the polar triangle are the exterior angles of the original triangle and vice 
versa. 


The side lengths a, b,c of a spherical triangle satisfy the inequalities 
-—a+b+c>Q0, a—b+c>0, a+b—c>0, a+b+c< 2n. 


The first three inequalities follow directly from the triangle inequality of the spherical metric d. 
The third follows from 
d(A, B) + d(B,C) + d(C, A) < d(A, B) + d(B, —A) + d(—A, C) + d(C, A) = 2a. 

Applying the same reasoning to the polar triangle, we get for the exterior angles of a spherical 
triangle 

—A+8+4>0, 4&-B+49>0, a@+8-4F>0, &4+64+4<2n, 
and hence for the interior angles 

-a+6+7<17, a-B+y<n, a+B-y<17, at+B+y>tT. 
Theorem. The area of a spherical triangle with interior angles a, 0, y isa+B+y-T. 


Proof. The three great circles through A and B, B and C, C and A divide the sphere into eight 
triangles. Two of them are the triangle A, B,C and the triangle —A,—B,—C. They have the 
same area because they are symmetric with respect to the origin. The other six triangles all have 
a side in common with either triangle A, B,C or triangle —A,—B,—C and complement these 
to six digons, two with angle a, two with angle 3, two with angle 7. Altogether, the six digons 
cover each of the triangles A, B,C and —A,—B,—C three times, and the rest of the sphere once. 
So 


4(a@+ 8+) =sum of area of the six digons 
= area(S”) + 2area(AA, B,C) + 2area(A — A,—B,—C) 
= 4n + 4area(AA, B,C). 
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Theorem. Let a,b,c € R. Then 
(i) a,b,c satisfy the inequalities 


—at+b+c>Q0, a—b+c>0, a+b-c>0O, a+b+c< 27 (*) 


if and only if there is a spherical triangle with side lengths a, b,c. 
(ii) If it exists, this triangle is unique up to an orthogonal transformation of R°. 


Proof. (i) The “if” part was shown in the last lecture. To see converse, assume a, b, c satisfy (*). 
Note first that this implies 0 < a,b,c < a. Now ponder the following picture: 


(ti) Suppose A, B,C and A,B,C are the vertices of two triangles with corresponding sides of 
equal length. Because corresponding scalar products are equal ((A, B) = (A, B), etc.), the linear 
map T : R? > R? with T(A) = A, T(B) = B, T(B) = B is orthogonal. 


Remarks. (i) There is of course an analogous theorem regarding the angles of a spherical triangle. 


(tt) The region D := {(2) € R3 | a, b,c satisfy (+) is the interior of the tetrahedron with 

vertices (3), (7), (5), (=). The region of possible vectors of exterior angles (3) is also D. 
wT T 7 

By the previous theorem (and its analogue for angles), the side lengths determine the angles and 

vice versa. This gives a bijection D > D. If the side lengths approach a face (vertex) of D, then 

the corresponding angles approach a vertex (face) of D. 


Theorem. Side lengths and exterior angles of a spherical triangle satisfy the equations 


. —cosa+cosbcose : : 
cos @ = —— ‘ (side cosine theorem) 
sin bsinc 


— cos @ + cos 8 cos ¥ 


cos a4 = (angle cosine theorem) 


sin 8 sin ¥ 
and four more equations obtained by simultaneous permutations of a,b,c and a, B,4. 


The following proof using Gram matrices exemplifies a general method which will be useful 
again later. The Gram matrix for a (finite) sequence of vectors v1,...v, is the symmetric matrix 
((v;,0;))fj<1 of pairwise scalar products. 


Proof. Let V = (A B C) € R°*® be the matrix whose columns are the vertices of the spherical 
triangle, considered as column vectors. Then the Gram matrix for A, B,C is 


(A, A) (A,B) (A,C) 1 cose cosb 
G=V'V =[(B,A) (B,B) (B,C))=[cose 1 cosa 
(C,A) (C,B) (C,C) cosb cosa 1 
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(Note for later that det G = (det V)? > 0.) Similarly, let W = (A’ B’ C’) be the matrix of poles. 
Their Gram matrix is 


(A’, A’) (A, BY) (A,C") 1 cos} cosf 
G’=W'tw =| (BA) (BB'S (B',C’)| =| cos7 1. cose 
(C’, A’) (C',B') (C',C’) cos6B cosa 1 
Also, 
(Al A) 4A’ BY (AIC) (A’, A) 0 0 
Wtv = | (B’,A) (B’,B) (B,C)|= 0 (B’, B) 0 =:D 
(OAs OB: ACT se) 0 0 (C",C) 
is a diagonal matrix with positive entries. SoW*t = DV~! and W = (V‘)~1D, and 
Ge DVO Be DIV VD = DE" "Dp. (**) 
The inverse of G is 
1 sin? a —cosc+cosacosb —cosb+cosccosa 
Retry, —cosc+cosacosb sin? b — cosa + cos bcose 
‘ —cosb+cosccosa —cosa+cosbcose sin? c 


1 
det G 


Substitute this into (**) and consider diagonal elements: One finds 1 = D?, sin? a, therefore 


Dy = YS, and similarly Do. = Y$42, D33 = YAS. Now consider for example element 


sin 
(3,2) in (**): cos & = D33 5-45 (— cosa + cos bcos c)D22. This is the side cosine theorem. 


The angle cosine theorem is the side cosine theorem applied to the polar triangle. 


Theorem. Side lengths and interior angles of a spherical triangle satisfy 


sina sinb sinc 


(sine theorem) 


. . —_ . >) 
sina sinf siny 


(half-angle theorem) 


a i sin(2=+2) sin( 2+8=<) 


sin(=*42+*) sin( Stee) ” 


cos( =ot bry) cos( otb+7) 


2-849) eos( 9481) 


(half-side theorem) 


of 
2 
5 
| 


2 cos( cos( 


cos a—cos bcosc 


Proof. In terms of the interior angle a, the side cosine theorem says cosa = sinbsine 


Using cosa = 2cos? s-ASel- 2sin? 2 and other trigonometric identities, one obtains the 


2 
equations 


ae sin(=*32*) sin( atbhtc) 
2 sin b sinc : 


ing = Em 
2 


sin b sinc ; 

which are also of independent interest. Dividing one by the other, one obtains the half-angle 
theorem. The half-side theorem is the half-angle theorem for the polar triangle. To prove the 
sine theorem, consider 


; . a a P 
sina = 2sin —cos — = — —, 
2 2 sin b sine 
where 


P= 2yfsin (EAP FS) sin (PE FE) sin (AE 8) gin (AHP S) | 


So sinasinbsinc = P. But since the expression for P is symmetric in a, b,c, one has equally 


sinasinbsinc = sin sincsina = sinysinasinb = P. 


Divide by sinasin bsinc to obtain the sine theorem. 
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In principle, all relations between side lengths and angles of a spherical triangle can be derived 
from the inequalities for the sides and the side cosine theorem (or alternatively from the inequali- 
ties for the angles and the angle cosine theorem) using only algebra and analysis, without further 
recourse to geometry. We have derived the half-angle theorem and the sine theorem in this way. 
On the other hand, the following proof of Napier’s rule, which is due to Napier himself, uses a 
remarkable geometric construction. 


Theorem (Napier’s rule). Consider a right-angled spherical triangle 


with y = F, and leta= 5 a,b=% b. Then 


cosc = sinb sina = cota cot 8, 
cos 8 = sina sinb = cotc cota, 
cos G = sinc sina = cot 6 cot b, 
cosb = sin 8 sinc = cotG cota, 
cosa = sina sin 8 = cot b cotc. 
That is: “The cosine of any part is equal to the product of sines of 


opposite parts and to the product of cotangents of adjacent parts.” (The 
“parts” are a, b, a, c, 8, in this cyclic order.) 


cos c—cos a cos b 


Proof. The first line of equations follows directly from the cosine rules cosy = “SS—=Ee 


and cosc = coe toe eb . To prove the remaining equations, assume first that a,b,c,a, 8 < 5. 


Consider the following construction. Draw the two great circles that have A and B as poles. 
Together with the extended sides of the original triangle, they form 4 other right angled triangles: 


(The five triangles form a right-angled pentagram called the pentagramma mirificum. In its 
center there is a spherical pentagon each vertex of which is the pole of the opposite side.) Note 
that a, = 5-66, =5- 8,0, = 5 -a,c, = 5 —b, 6, =a, so that 

a, =, bi = B, a1 = a, co =), By =a. 


This proves the other equations of Napier’s rule under the assumption of acute angles and sides. 
Now suppose that a side length a,b,c or an angle a, 8 is greater than }. (The remaining cases 
where one is equal to 5 consist of doubly or triply right-angled triangles for which Napier’s rule 
can easily be checked.) It can be shown that, first, for one of the neighbor triangles (into which 
the sphere is separated by the same great circles) a,b,c,a,8 < 5, and, second, if Napier’s rule 


holds for one of the neighbor triangles, it holds for all of them. 
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Stereographic projection 


Project the unit sphere S? C R® from the “north pole” e3 = (8) to the plane x3 = 0. This map 


a : S? \ {e3} > R? is called stereographic projection. One easily derives the following equations 
for o and its inverse: 


Theorem. Stereographic projection o maps circles in S* which contain e3 to lines in R? and all 
other circles in S? to circles in R?. All circles and lines in R? are images of circles in S?. 


Remark. The fact that circles through e3 are mapped to lines is geometrically clear: A circle 
through e3 is the intersection of S$? with a plane through e3. So all projection rays lie in this 
plane, and the circle is mapped to the line in which it intersects the image plane x3 = 0. 


Proof. A circle in S? is the intersection of S? with a plane 
E = {x € R? | (a,n) = d}, where |n||/=1, O<d<l. 


It contains e3 iff d = (e3,n) =n3. A point u € R? in the image of the circle iff o~'(u) € E, that 
is, iff 
1 
d = ~~~ (2uy ny + Quyng + (u2 + u2 — 1)n3), 
easel (2uym1 + 2ugne + (uy + uz — 1)ns) 


or equivalently, 


0 = (nz — d)(uZ + ud) + Quin + 2ugng — (n3 +d). 


If ng = d, this is the equation for a line. Otherwise, divide by n3 — d, complete the squares, and 
use ||n||? = 1 to obtain 


22 
(m+ =o a) +(e = an a FP a 


“ihe 5 ‘ : - 4 22) =i ny : _ Vi-d? 
This is the equation for a circle with center ¢ = —— ( a) and radius r = eerie 


To show (without further calculations) that every circle and every line in R? is the image of a 
circle in S?, you can make an argument using the fact that three points in R? uniquely determine 
a line or circle through them and three points in S* uniquely determine a circle through them. 
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Theorem. Stereographic projection is conformal, that is, it preserves angles: If two curves in 
S? \ fe3} intersect at some angle, then their image curves in R? intersect at the same angle. 


Proof. Let 4, : (—e,¢) + S$? \ {e3} be two curves with 4(0) = (0) = p, and let y = 0 04, 
n = 00% be their images curves in R?. Let 6 = 4/(0), w = 4'(0), v = 7/(0), w = 7'(0), and let 
p = (0) = 7(0) = o(p). The intersection angles @ and a between 4, 7 and y, 7, are determined 
by 


eae Lk eee ME 2 
(6, 6) (Ww, w) a/(v, v) (w, w) 


We want to show that @ =a. In fact, we will show that 


(6,0) = wpe (%¥), (6,6) = Gegyere (YY), (0, B) = BEBE (ww), 
and this implies cos @ = cosa, and hence @ = a. 


First we derive equations for 6, w in terms of v, w. By the equation for a~! we have 


a 1 li 
1 Gay Gant 


so 


and hence 


ee ie poy (3p 
b6=V(0)= (pp) +1 ((3,) ~ (p,p)+1 (,,,)) : 


In the same way one gets 


~_ 2 wa) _ _(p,w) 2p. 
w= @p)+1 ((3,) (pp) +1 (,,)) , 


so 


(6,8) =p ota (eu) + (p,v)(p,w) — es (2(p, v) + (p, 0) ((p,p) - 1)) 
(p,v) ((p,p) +1) 
— eh; (2tp,w) + (p, w) ((p.p) — 1) + Bee (4(p,) + ((p.) — 1)? 
(p,p) +1 AMPs P, P,P q ((p,p) +1)? P,P P,P 


(p,w) ((p,p) +1) ((p,p) +1)? 


and similarly for (6,6) and (ww, w). 


All of this works in the same way also for the n-dimensional sphere S$” € R™*+. (But for n = 1, 
the projection from the circle to the line, the two theorems are trivial.) Stereographic projection 
from €n41 isa: S” \ {en4i} 3 R", 


1 ce 
a(t) = 7 7 ( ; ) 


1 — 
= mai ( : 
(u, u) + 1 (dei 2A 


It maps (n — 1)-dimensional spheres in S$” (intersections of S” with affine hyperplanes in R”*') 
to (n — 1)-dimensional spheres and planes in R”, and it is conformal. 


q 
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Bilinear and quadratic forms 


We will define n-dimensional hyperbolic space as 


H” = {ze R"™**| a7 bas+...+2% —27°.,=-1, tn41 > 0} 


={reR"! | (x,z) = —1, tn41 > OF, 


where now (a, y) = @1yit...-+2nYn—2n+1Yn41, and lengths of curves in H” and angles between 
them are measured using this scalar product instead of the normal Euclidean scalar product. We 
will need to be familiar with such indefinite scalar products, and since we will deal with general 
bilinear and quadratic forms later, it may be a good idea to refresh some material from linear 
algebra. 


Let V be an n-dimensional vector space over a field kK. (We will be interested in the cases 
K =Rand kK =C.) A bilinear form on V is a function b: V x V > K, which is linear in 


each argument. If e1,...,e, is a basis of V, then the matrix of the bilinear form bis Be K"*” 
with B;; = b(e;,e;). If z,y € K” are the coordinate vectors for v,w € V, that is v = 50; vie; 
and w = 3°, ye:, then b(v,w) = x By = aor Biyziy;. Tf fi,..., fn is another basis with 


fj =>0, Tijex, and p,q € K” are the coordinate vectors of v,w in this new basis, then « = Tp, 
y = Tq, and the matrix of b with respect to the new basis is B = T* BT. 


The bilinear form B is symmetric if b(v,w) = b(w,v) for all v,w € V. A bilinear form is 
symmetric if its matrix with respect to one (and hence every) basis is symmetric. 


A quadratic form on V is a function gq: V — K for which there exists a bilinear form 6 such that 
q(v) = b(v,v) forall ve V. (0) 


If q is a quadratic form, then there exists a unique symmetric bilinear form satisfying (0). Hence 
symmetric bilinear forms and quadratic forms are in one-to-one correspondence. Quadratic forms 
are homogeneous polynomials of degree 2 in (any) coordinates. 


Example. x? + 21x%2 + x3 is a quadratic form on K?. For example, it comes from the bilinear 
form 21 y1 +2 1y2 + X2y2, which is not symmetric. But it also comes from the symmetric bilinear 
form @1y1 + 5@1Y2 + 522y1 + Foy. 


Symmetric bilinear forms 
Suppose b is symmetric and q(v) = b(v,v). The kernel of 6 is 
kerb = {v € V | b(v, w) = 0 for all w € V3. 


This is a linear subspace of V. The bilinear and quadratic forms b, q are called degenerate 
if kerb A {0}, and non-degenerate if kerb = {0}. The form 6 is degenerate iff its matrix 
with respect to one (hence every) basis has determinant 0. Let Up = kerb and let U be any 
complementary subspace, so that V = U @ Up. Then the restrictions bly and gq|y are non- 
degenerate bilinear/quadratic forms on U. 


There exist bases €1,...,é, of V such that 
b(é:,€)) =O if j. (0) 


The basis vectors e; with b(é;, é;) = 0 form a basis for Up = ker b. Assume the basis is ordered so 
that these come last. Then the matrix of b with respect to this basis is diagonal with diagonal 
elements 1,...,A;,,0,...,0, where A; 4 0 and r = n— dimker b. In the coordinates uy1,..., Un 
with respect to this basis, gq is a sum of squares: 


du? + Agua Pte ie te dpu2. 


A basis €1,...,é€, satisfying (©) and the corresponding coordinates u1,..., Un, can be found using 
the generalized Gram-Schmidt orthogonalization procedure or by completing the squares. 
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Generalized Gram-Schmidt orthogonalization procedure 


The ordinary Gram-Schmidt orthogonalization procedure takes as input a basis e1,...,€, of V 
and a positive definite symmetric bilinear form b. It works like this: 
for i=1ton—1do 
for 7 =i+1tondo 


; —_ b(ei,e3) 
€j — €j — Bez ei) & 
end for 
end for 


This may not work if b is not positive definite because, for some 1, b(e;, e;) may be 0. If that is the 
case, do the following: If there is among the e;41,...,é@n, a basis vector, say ex, with b(e,, e,) #0, 
then swap e; and ex and continue. If that that is not possible because b(e;+1, €:41),---, b(€n, en) 
are all 0, then there must be some ex (i +1<k <n) with b(e;,e,) 40, because b was assumed 
to be non-degenerate. Assign e; + e, and e; — ex to e; and ex and continue. 


If b is degenerate, find a basis for ker b and a basis for a complementary subspace and apply the 
orthogonalization procedure to the latter. 


Warning: This algorithm is not numerically stable. Its main purpose is to prove the existence of 
a diagonalizing basis. 
Completing the squares 


Examples. Suppose in some coordinates q is 


2 


+ 010g +4123 4+ L2X3 


1 1 2 32 env 1 
Ly + at2 + 523) os qr2 qX3 + 5 l2X3 


— 1 2 3 1 2 20 2 
=(@1 + 9&2 + 9&3)" + 4(@2 + g¥3)" + 7p #3 
— . “~~ 
uy uz 08 
=<. 2 3 2 20 2 
= Uy, + A U2 + 2 U3. 


It may happen that there is no square to complete: 


=n) 2 1 2 a 1p 8 vd 1 
£1Lq + 2o%3 = 7(L1 + 22) 4 (%1 — 2)” + fo%3 = Gyr + Gyo + 5(Y1 — Y2)3, 
Y1 Yy2 


continue as before. 


Case kK =C 


We can always make all \1,..., A, equal 1 by dividing e, (k = 1,...,r) by /b(ex, ex) (arbitrary 
choice of square roots). Thus, any symmetric bilinear form on a complex vector space is in 
suitable coordinates 7), UKYk- 


Case K =R 


Let 


i, = (number of é, for which b(é,, é,) > 0), 
é 


i_ = (number of €; for which b(é,, €x) < 0), 


io = (number of é; for which b(é,, €,) = 0) = dim ker b. 


The numbers i,,%_,%9 do not depend on the particular basis, but only on b. (Why?) The 
numbers 74 and i_ are the positive and negative index of b. The signature of b is (i+,1_,%0), also 
written (i,,i_) if i9 = 0. We can normalize the é, (k = 1,...,r) by dividing by ./|b(éx, éx)| 
Thus, one obtains a basis with b(é;,é;) = +1 or 0, and any symmetric bilinear form on a real 
vector space is in suitable coordinates 


a4 t4+i_ 
Skye — S > cKyr- 
k=1 katz 
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Scalar products 


A scalar product is a non-degenerate symmetric bilinear form. Scalar products are often written 
(,-) or (-,-). Vectors v,w with (v, w) = 0 are called orthogonal to each other. 


Any n-dimensional complex vector space with a scalar product can by an appropriate choice of 
basis be identified with C” with the scalar product 4 LEYk- 


IR”? denotes the real vector space R?*?, equipped with the scalar product 


Pp pra 
(x,y) = > URUK — S- URUK» 
k=1 k=p+1 


which has signature (p,q). By an appropriate choice of basis, any real vector space with a scalar 
product with signature (p,q) can be identified with R?t?. A vector v in R?4 (or any other real 
vector space with scalar product) is called spacelike, timelike, or lightlike if (v,v) > 0, < 0, or = 0. 
The set of lightlike vectors is called the light cone. A Euclidean scalar product is a scalar product 


with negative index i_ = 0. A Lorentz scalar product is a scalar product with negative index 
i_ = 1. A vector space with a Euclidean or Lorentz scalar product is a Euclidean or Lorentz 
vector space, respectively. An orthonormal basis for R?? is a basis €1,...,én with (e;,e;) = 0 if 


iA#Aj, (ei,e;) =1 fori=1,...,p and (e;,e;) = —1 fori =p+1,...,n. 


Theorem. Let V be a real vector space with scalar product (-,-), let e1,...,@n be a basis, let 
Bi; = (ei, e;), and let 


Bi: Bir 


: ie dy, = det B. 


ago = 1, a, = By, a = det ( 3B)... on =a ( 
Bric: Ber 


Suppose that none of the ax are 0. Then the negative index of the scalar product (-,-) is equal to 
the number of sign changes in the sequence ao, @1,...,Qn- 


Orthogonal transformations 


Let V be a vector space with scalar product (-,-). An orthogonal transformation on V is a linear 
map T: V > V with (Tv, Tw) = (v,w) for all v,w € V. The orthogonal transformations form 
a group, the orthogonal group of V, (-,-). The orthogonal group of C” with standard scalar 
product is denoted by O(n,C). The orthogonal group of R?*? is denoted by O(p,q), or if g = 0 
also by O(p) or O(p,R). The corresponding matrix groups are also denoted by O(n, C), O(p,q). 
The columns of a matrix in O(p,q) form an orthonormal basis of R??. The determinant of an 
orthogonal transformation is +1. The orthogonal transformations with determinant +1 form 
subgroups, the special orthogonal groups, denoted by SO(n, C) and SO(p, q). 


Lorentz vector spaces 


R}! is R? with (x,y) = 21y1 — ay2. So (a,x) = x? — x 


and this is =0 if || 7 |v2|. The figure also shows another 
orthonormal basis. 


R?+ is R? with (x,y) = x,y, + Loy2 — £3y3. So (x, x) = 
x? + 23 — x2 and this is =0 if \/at + 23 : |r3|. 


In a Lorentz vector space, any non-zero vector which is 
orthogonal to a timelike vector is spacelike. 


An orthogonal transformation T € O(n,1) either maps 
each connected component of the hyperboloid (a, 7) = —1 
onto itself, or it interchanges the two components. Those 
that map each component onto itself form a subgroup, 
Ot(n,1). Also, SOt(n,1) = OF(n,1)N SO(n,1). In 
all, O(n, 1) has 4 connected components, each consisting 
of the transformations with determinant either +1 or —1 
and either fixing or interchanging the components of the 
hyperboloid (x, 2) = —1. 
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Hyperbolic geometry 


Hyperbolic space of n dimensions is 
H® = {¢ € R™ | (2,2) = —-1, 2n41 > 0}, 


where (x,y) = 7141 +... +2nYn — Ln41Yn+1 is the Lorentz scalar product of R™!. Lengths of 
curves and angles between them are measured using this scalar product (and not the Euclidean 
scalar product of R"t'): The length of a curve 7: [t1,t2] > H” is 


length(y) = f° (y(t), 7/(t)) at 


and the angle a between two curves 7,7 : (—e,¢) ~ H” intersecting in p = y(0) = 7(0) with 
non-zero velocities v = (0), w = 7'(0) is determined by 


cosa = a SAS . 
(u,v) (w.w) 


(vw) 


Why is this well defined? Why is (7’,7’) never negative and why is <1? For 


(v,v) (w.w) 
v ER”! let vt ={w € R™! | (v, w) = O}. If (v,v) < 0 (v is timelike) then the restriction of the 
scalar product (-,-) to vt is a Euclidean scalar product. Now (y,) = —1 implies (7,7) = 0, 
so y'(t) € 7(t)+. In the formulas for lengths and angles, the Lorentz scalar product is therefore 
applied to vectors in a subspace of R™+ on which it is Euclidean. 


One-dimensional hyperbolic space 


H' is one branch of the hyperbola 21? — x2? = —1 in R". 


agree! % 


This is a curve which can be parameterized as 7(s) = (Sibs). (Because cosh” s — sinh” s = 1.) 


Now 7/(s) = (coh), so (7/(s),7’(s)) = cosh” s — sinh? s = 1 and 


sinh s 
82 
ciel = if SAO) Sak 
81 


On the other hand, 
(y(s1), ¥(s2)) = sinh s; sinh sz — cosh sy cosh sz = — cosh(s, — s2). 


(Because cosh(x + y) = cosh x cosh y + sinha sinh y.) Hence, the hyperbolic distance d(pi, pz) of 
two points p1,p2 € H' is given by 


— cosh d(pi, p2) = (p1, p2). 


Hyperbolic lines 


A hyperbolic line in n-dimensional hyperbolic space H” is a non-empty intersection of H” with 
a 2-dimensional linear subspace U of R™?. 
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Proposition. If U is a 2-dimensional linear subspace of R™! with UNH” # 0, then the restric- 
tion (-,+)|u has signature (1,1). 


Proof. By assumption, there is a u € U with (u,u) = —1. Extend wu to a basis u,v of U, and let 
b=v+4 (u,v)u. Then u,v is a basis of U with (u,#) = 0. Because any non-zero vector which is 
orthogonal to a timelike vector is spacelike, (#,0) > 0. So (-,-)|y has signature (1,1). 


By an appropriate choice of basis, any 2-dimensional U intersecting H” can therefore be identified 
with R!1. The hyperbolic line UM H” is thus identified with 1-dimensional hyperbolic space 
H' e€R?. 


Remark. In the same way, any non-empty intersection of H” with a (k+1)-dimensional subspace 
can be identified with H*. 
For two points pi, p2 € H”, there is a unique hyperbolic line containing them: span(p1, p2)N H”. 


Theorem. The shortest piecewise continuously differentiable curve connecting two points p1,p2 © 
H” is the hyperbolic line segment between them. Its hyperbolic length is 


d(p1, p2) = arcosh(—(p1, p2)). 
This can be proved in the same way as we proved the corresponding theorem for the sphere. 
Two-dimensional hyperbolic space 


The hyperbolic plane H? is one component of the hyper- 
boloid of two sheets 21? + x2? — #3? = —1 in R®!. Any 
2-dimensional subspace U of R?:! which intersects H? is 


U = {x € R?" | (z,n) = 0} 


for some n € R* with (n,n) = 1. The vector —n would 
give the same subspace, but up to sign, the unit normal 
n of U is unique. Thus, the spacelike unit vectors in R?! 
are in 2-to-1 correspondence with the hyperbolic lines in 
H?. They are in 1-to-1 correspondence with the hyperbolic 
half-planes 


{x € R®" | (x,n) > 0} H?. 


Proposition. Let n,,n2 € R?+ with (ny,n1) = (n2,n2) = 1, and let l,,lz be the corresponding 
hyperbolic lines, l; = {a € H? | (x,n;) = 0}. Assume n, #4 ne, so that ly and lz are different 
lines. Then the following statements are equivalent. 


(i) The lines l, and lg intersect. 

(ii) The restriction of (-,-) to span(n1,n2) has signature (2,0). 

(iii) (m1, m2)| < 1. 
Proof. (i)=(ii): If l,l. 4 @ then there is an x € H? with (a,n1) = (a,n2) = 0. So at = 
span(n;,72), and (ii) follows because any non-zero vector orthogonal to a timelike vector is 
spacelike. 


(ii) (i): If the restriction of the scalar product to span(n1,n2) has signature (2,0), then its 
restriction to the orthogonal complement span(n,,n2)+ must have signature (0,1), so the com- 
plement intersects Ho. 


(ii)}(iii): The vectors n1,n2 form a basis of span(ni,n2). In this basis, the matrix of the 


restriction of (-,-) is 
ee age 


So By, = 1 and det B = 1 — (ni,n2)?. The equivalence of (ii) and (iii) follows from last lecture’s 
signature theorem (and the fact that the matrix of a non-degenerate bilinear form has non-zero 
determinant). 
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Now suppose the hyperbolic lines 1,,/2 intersect in x € H?, and let h; = {y € H? | (y,n;) > O} 
be half-planes bounded by 11,12. Since the Lorentz scalar product of R*:! is Euclidean on the 
subspace x+ = span(n1,n2), we can measure angles between vectors in z+ in the usual way. 


The exterior angle @ of the half-planes h,, hz at x is determined by 
cos & = (n1,N2). 
The interior angle a is 7 — G, so 
— cosa = (Ny, N2). 
In particular, J; and ly intersect orthogonally if (n,,n2) = 0. 


Remark. A hyperbolic rotation (of H) with center x € H? is a map T € O(2,1) with T(x) =x 
and which is a Euclidean rotation on «+. The exterior angle between the two half-spaces h1, hz 
is the angle of the hyperbolic rotation mapping one to the other. 


Proposition. (i) If x € H? and 1 is a hyperbolic line, then there is a unique hyperbolic line 
through x which intersects | orthogonally. 


(it) If l,, lo are two hyperbolic lines with unit normals n1,n2 such that |(n1,n2)| > 1 (so the lines 
do not intersect), then there is a unique line l3 intersecting both l, and lz orthogonally. 


Proof. Exercise. 


Hyperbolic triangles 


Let A, B,C € H? be three points in the hyperbolic plane. Assume that they do not all lie on one 
hyperbolic line (this is equivalent to assuming A, B, C to be linearly independent). The hyperbolic 
triangle with vertices A, B,C is the intersection of H? C R*! with the set of non-negative linear 
combinations 

{AA + uUB+vC| A, u,v € Rso}. 


The side lengths a = d(B,C), b = d(C, A), c= d(A, B) satisfy 


—cosha = (B,C), —cosh b = (C, A), —coshc = (A, B). 


Let A’, B’,C’ be the spacelike unit vectors such that the half-plane bounded by the line through 
B,C and containing A is 

ha = {a € H? | (A',x) > 0}, 
and analogously for B’ and C’. Then the hyperbolic triangle with vertices A,B,C is also the 
intersection h4; MN hg: Oho. The interior angles a, 3,y at A, B,C satisfy 


—cosa = (B’,C’), —cos 8 = (C’, A’), —cosy = (A’, B’). 
Theorem. The side lengths and interior angles of a hyperbolic triangle satisfy 


— cosh cosh b cosh 
cos a = . ze - 5 (hyperbolic side cosine theorem) 
sinh b sinh c 


cos a + cos 8 cos y 
sin 8 sin y ; 


cosha = (hyperbolic angle cosine theorem) 


Proof (sketch). This can be proved in the same way as we proved the spherical cosine theorems 
using the Gram matrices G,G’ of A, B,C and A’, B’,C’. Only now the scalar product is 


hh : _ (10 0 
(x,y) =a" Ey with B=(g1 0). 


So if V =(A BC) and W = (A’ B’ C’), then 


t —1 —coshe —coshb 7 t 1 —cosy —cos 8 
G=V'EV = (~coshe -1 cosha ), G = W'EW = (- 2084 1 = cosa), 
—coshb —cosha -1 —cos B —cosa 1 


and D = W'EV is a diagonal matrix with positive elements on the diagonal. Continue as in the 
spherical case ... 
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Remark. The spherical cosine theorems for a sphere of radius R (instead of 1) are 


= a b Cc \ 
_ COS # — COS F COS F a cosa@+cos cosy 
COS @ = hoe a ; cos = = ; - 
sin 7 sin 5 R sin @ siny 


One gets the hyperbolic cosine theorems by setting R =i. That’s why it is sometimes said that 
hyperbolic geometry is the geometry on a sphere with imaginary radius. 


From the hyperbolic cosine theorems, one can derive 


sinha = sinhb _ sinhc 


- = — (hyperbolic sine theorem) 

sina sin 8 sin y 
in the same way in which we derived the spherical sine theorem from the spherical cosine theo- 
rems. One can also derive hyperbolic versions of the half-angle and half-side theorems, and other 
formulas of spherical trigonometry. 


Theorem. (i) A hyperbolic triangle with side lengths a,b,c € Ryo exists if and only if the triangle 
inequalities are satisfied. (ii) A hyperbolic triangle with angles a, B,y € (0,7) exists if and only 


if 
at+B+y<n. 


Remark. We will see later that the area of a hyperbolic triangle is 7 — (a+ 6+4+7). 


Proof (sketch). 1. Show that a symmetric (3 x 3)-matrix G is the Gram matrix of 3 linearly 
independent vectors in R?! if and only if the bilinear form x‘Gy has signature (2, 1). 


. -1 —coshe —coshb 
2. Consider G = (- coshe -1 — cosh )) Because 
—coshb —cosha -1 
C4126: det (Gi? G2) = — sinh? c < 0, 


the signature of G is (2,1) if and only if det G < 0 (by the theorem from Lecture 6). 
3. Show the remarkable identity 


det G = —4sinh (=2+%+2) sinh (2=2+2) sinh (@#2=2) sinh (242+), 


and use it to prove part (i) of the theorem. 


1 — cosy —cos8 
Part (ii) can be shown in the same way by considering G’ = ( at oS cosa) and using the 
—cos B —cosa 1 
identity 
det G’ = —4 cos (=*5*) cos (=*) cos (45=*) cos (4+*). 
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The Klein model of the hyperbolic plane 


Project H? Cc R*! to the plane x3 = 1 with 0 € R*! 
as center of projection. This is central projection, 
because 0 is the center of the hyperboloid of which 
H? is one sheet. A point (22) € H? is mapped to 


+ ( 2 ), and all of H? is mapped to the inside of the 


x3 


unit circle in the image plane x3 = 1. Hyperbolic 
lines are mapped to secants of this unit circle. If we 
forget about the x3-coordinate, we get an image of 
the hyperbolic plane which is called the Klein model. 
Thus, the Klein model of the hyperbolic plane is the 


unit disk From W. P. Thurston, Three-dimensional geometry and Topology. 


D? = {ue R? | up +u3 < 1}, 


x 
where (#2) € H? corresponds to a (zs) € D?, and, inversely, (%}) € D? corresponds to 


: (2) € H?. Angles and lengths in are measured in H?. 


2 
ug \ 1 


1l-uy— 


For two hyperbolic lines 1,,l. with unit normals n;,n2 € R*"!, there are three possibilities: 


(1) [{mi, m2)| <1. (2) |(na,n2)| = 1. (3) [(ma,n2)| > 1. 

The lines intersect. The lines do not intersect and The lines do not intersect and 
their images in the Klein model their images in the Klein model 
intersect on the unit circle. intersect outside the unit circle. 


In cases 2 and 3 the lines do not intersect, thus they are parallel. However, to distinguish the 
two cases, parallel is sometimes used to mean only lines in case 2, and lines in case 3 are then 
called ultra-parallel. 


Angle of parallelism 


If 1 Cc H? is a hyperbolic line and x € H? is a point not on 
1, then there exist two parallels (in the narrow sense) to | 
through x and infinitely many ultra-parallels. The angle a 
between one of the parallels and the perpendicular through 
x is called the angle of parallelism. It depends only on the 
distance b from z to I. In fact, a = 2arctan e~? (exercise). 


More distance formulas 


Let « € H? and let 1 = {y € H? | (y,n) = 0} be a hyperbolic line. 


Proposition. The point on 1 that is closest to x is the intersection xp of | with the line |, through 
x that is perpendicular to I. 


Proof. The case x € | is trivial, so assume x ¢ I. Let y: R > 1 C H? be a parameterization 
of 1 with unit speed. So (y,7) = —1, (y,n) = 0, and (7,7) = 1. Let f(s) = d(z,7(s)) = 
arcosh(— (x, y(s))). Convince yourself that f(s) > oo as s + too. So f must attain a minimum, 
say at s = so. Then f’(so) = 0, and this implies (x, '(so)) = 0. Let x» = y(s0), Np = 7’ (So), 
and let I, = {y € H? | (y, np) = 0}. Then 

@ Zp is the point on | closest to x, 

e (¢,%,))=O0so 2 €1,. 

© (Zp,Np) = 0 because (y,7/) = 0, so rp E Ip, 
e (N,Np) = 0 because (7’,n) = 0, sol L Ip. 
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Proposition. The distance d(x,l) from x tol satisfies |(x,n)| = sinh d(z, 1). 


Proof. Let V = (x ap n np) and let E be the diagonal matrix with 1,1,—1 on the diagonal. 
Then 


ee ria hes ea -1 —coshd(xp,x) (a,n) 0 

_ t = Lp,L) \Lp,Lp) \Lp,N) (Lp Np es — cosh d(xp,x) -1 0 Oo 

O= det V'EV = det | (nie) (zp) (nin) (ming) = et ( wee 0 1 : 
(np,£) (Np, Lp) (Npn) (Np,Np) 0 0 Oo ol 


= 14 (x,n)? — cosh? d(ap, x) = — sinh? d(ap, 2) + (x,n)? 


and d(«,,x) = d(x,1) by the previous proposition. 


Of course the sign of (2,n) depends on whether or not « is contained in the half-plane (a, n) > 0. 
Proposition. The distance d(l;, lz) between two lines l; = {y € H? | (y, ni) = 0} with |(n1,n2)| >1 
satisfies |(n1,n2)| = cosh d(y, Iz). 

Proof (sketch). This can be proved in the same way as the previous proposition, but letting 


V = (« ni ng np), where 1, = {y € H? | (y,np) = 0} is the common perpendicular to 1,,l2, and 
4 ly M Ip. 


The sign of (n1,n2) depends on whether or not one of the half-planes (y,n;) > 0 contains the 
other. 


Summary. Let 71,22 € H?, let n,n1,n2 be unit spacelike vectors and let 1,1,,l2 be the cor- 
responding lines. The two sides of each line are marked + and — according to the sign of the 
scalar product of points in H? with the chosen normal on that side. 


(v1, 7) = sinh d(#1, 1) 
(21,2) = —coshd(a1, £2) (a2,n) = — sinh d(x2, l) 


(n1,N2) = cosh d(l;, la) (n1,n2) = cos & = — cosa 


] (n1,n2) = — cosh d(Iy, 12) 


Hyperbolic “trilaterals” 


Let us define a hyperbolic trilateral as a non-empty intersection of three half-planes, of which 
none is contained in another. If we only consider the generic cases where pairs of boundary lines 
intersect or are ultra-parallel then there are four types of trilaterals according to the number of 
vertices. In the figures, the common perpendiculars of ultra-parallel lines are drawn in yellow. 


The first case is the case of triangles. In the other cases, one can also derive trigonometric 
formulas in the same way as we did for triangles. Of particular interest are trilaterals of type (4), 
which correspond to right-angled hexagons. For those one obtains the cosine and sine theorems 


eae! cosh a + cosh b cosh c a sinha’ sinhb’ — sinhc’ 
= ni — = . 
“ sinh bsinhc sinha sinh b sinh c 
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Intersections of H? C R?:! with planes 


Any plane in R*! is determined by an equation of the form (z,v) = b with 
Y y 


v € R* \ {0} and b ER. There are three types of planes according to whether v SF 


is spacelike, timelike or lightlike. 

e If (v,v) < 0, we may assume that (v,v) = —1 and v3 > 0, so v € H?. A plane 
of this type which intersects H? is of the form (v,x) = —coshr. The intersection <——f{-4 
is therefore a circle with center v and radius r. i 


e If (v,v) > 0, we may assume that (v,v) = 1 and b > 0. A plane of this type 
always intersects H? and is of the form (v,2) =sinhr. The intersection with H? 
is therefore a curve of constant distance from the line (v,x) = 0. (For r = 0, it is ©— 
the line itself.) ; 


e If (v,v) =0, a non-empty intersection with H? is called a horocircle. 
The Poincaré disk model of the hyperbolic plane 


0 
Project H? to the plane 23 = 0 with e3 = ( °,) as center 


of projection. This is stereographic projection of H?. It maps 
H? to the unit disk of the plane x3 = 0. Analytically, it is the 
map o72 : H? > D?, 


(2) x 
XY 1 11 nee 
O H2 es = 23 7m i. (2) ’ 
1 2uy 
ont 2 = ——_.—_,; 2u2 
ug 1 — uy? — ue? 


1+ uy? + ue? 


Theorem. Stereographic projection of oy2 : H? — D? 
maps intersections of H? C R*! with planes in R* to 
intersections of D? with circles and lines. 

In particular, hyperbolic lines are mapped to intersections 
of D? with circles and lines that intersect the unit circle 


OD? orthogonally. 7 lines, 2 circles, 1 horocircle (left). 


This can be shown by a calculation like in the case of $7. Curves of constant distance form a 
line (right). 


Theorem. Consider two curves ¥, i): (—e,¢) + H? in the hyperbolic plane with 4(0) = 9(0) = p. 
Let y = oy2 °F and n = oy2 07% be their images in D? under stereographic projection, and let 
v= 7 (0), w= (0), Ul 7 (0), w= 7 (0). Then 

4 
(lpi? par)? 
where (-,+)p2,1 is the Lorentz scalar product of R?:! and (-,-)p2 is the standard Euclidean scalar 
product of R?. 


This, too, can be shown by a calculation like in the case of $?. 


(0, O)R21 = (v, w)R2 5 


Hence oy2 is conformal in the sense that curves in H? intersecting at 
some angle are mapped to curves inside the unit circle of the Euclidean 
plane intersecting at the same angle. 


One can measure hyperbolic lengths and angles directly in D? by using 
the variable scalar product 


4 
v,w) = VU, W) Re - * 
Ip( ) al —p2 — pp”)? ( dR ( ) 
For example, the hyperbolic length of a curve in H? C R*! which o72 glavbech 
maps to ¥: [t1,t2] — D? is be 
‘ihe 


/ * fon.) at. 
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In the image of H? under stereographic projection, lengths appear scaled down by the variable 
factor $(1 — pi? — po”). The image in D? of an object in the hyperbolic plane gets smaller and 
smaller as it moves towards the boundary circle 0D?. 


A Riemannian metric on an open set U C R” is a variable Euclidean scalar product. More 
precisely, it is a C° map g : U x R” x R" > R, (p,v,w) © gp(v,w), such that for each 
p €U, g,(-,-) is a Euclidean scalar product on R”. (Thus, g,(v,w) = v'G(p)w with a matrix 
G depending on p € U.) One can then measure lengths of curves and angles in U using the 
Riemannian metric, and this is called Riemannian geometry. 


A Riemannian metric g on U C R” is called conformal if 


Ip(u, v) = A(p) (u, VU) Rn 


for some function A: U + Ryo. If this is the case, angles measured using g are equal to the 
Euclidean angles measured using (-,-)Rn. 


Thus, gp as defined by equation (*) is a conformal Riemannian metric on D?. The unit disk D? 
with this Riemannian metric is called the Poincaré disk model of the hyperbolic plane. 


From Klein model to Poincaré disk model via the hemisphere model 


We have encountered two ways to map H? € R* to the unit disk. Central projection gives the 
Klein model and stereographic projection gives the Poincaré disk model. The composition 


central stereographic 


» H? » D? 


projection projection 


D? 


is a peculiar self-map of the unit disk D?, which maps secants of D? to circles orthogonal to the 
boundary 0D?. 


Proposition. The same map D? + D? is also the result of the 
following construction: First, project D? C R? C R® orthogo- 
nally down to the lower hemisphere of S? C R°. Then project 
stereographically back to D?. 


From Prasolov & Tikhomirov, 
Geometry. 
Proof. Let « = (=) € H?. Via central projection, this corresponds to a ee) in the Klein 
3 : 
model. Orthogonally projecting down to the lower hemisphere of S$? gives 
x1 /x3 Ly 1 Ly 


1 


£2/X3 = — x2 = — LQ 


(ia Be) Never ay tt 


Projecting this point back to the unit disk (by stereographic projection of $7) results in 


(x8) =e (Baa 
3 \ 1 lines rs wg3+1 \%2 ; 


which is the same as o772(2). 


The calculation in the proof shows that (=) > i( z ) maps H? to the lower hemisphere 
3 


£3 — 


of S?. Equally (=) K i (2) maps H? to the upper hemisphere of $?. These images of H? 
x3 x3 


are called the (lower and upper) hemisphere models. The hemisphere models are also conformal, 
and circles, horospheres, lines, and curves of constant distance from lines are represented by 
(parts of) circles in $?. 
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The Poincaré half-plane model 


One obtains the Poincaré half-plane model of H? by projecting 
the upper hemisphere model stereographically from a point on 


the equator, say from e, = { 0 ) to the plane x, = 0. This maps 
the equator (minus e;) to the x-axis and the upper hemisphere \es 
to the upper half-plane of the the x2, 73-plane which we identify <S 
with the upper half-plane = 
Hi = {( ua ) | U2 > 0 } Cc R?’. From W.P. Thurston, Three-Dimensional Ge- 


ometry and Topology. 


Since stereographic projection is conformal and maps circles to circles and lines, hyperbolic lines 
are represented in He by half circles meeting the u,-axis orthogonally and vertical lines. You 
can show by a direct calculation that hyperbolic lengths and angles can be measured in H? by 
using the Riemannian metric 


1 
Gp(v, w) = Up? (v, W)R2- 
The half-plane H?. with this Riemannian metric is called the Poincaré half-plane model of the 


hyperbolic plane. In H?, hyperbolic lengths appear scaled by the variable factor uz which is the 
Euclidean distance to the boundary. 


Two examples for length calculations in the half-plane model (and some remarks) 


sint 
cost 


a as il cost cos t 
/ / — 622 
| I(t) (7 (t), 7 (t)) dt = ‘i lz cos2 t (r (: sce C ) ae 


1 1, 1+sina a 
= =-] = (S 7). heck ; , 
[ cose 5? 1 sae og tan 5) + 7 (Check it out!) 


(1) Consider the curve ¥ : [0,a] + H?, y(t) =r ( ) . Its hyperbolic length is 


(2) Consider the curve 77: [t1,t2) > HZ, n(t) = @: Its hyperbolic length is 


[vow J" Fa((3) (a fr 


t 
= log te — logt; = log >. 
1 


Note that in the first example, the length does not depend on r, and in the second example, the 
length depends only on the quotient t2/t. In fact, scaling transformations (%j}) + A(%i}) (with 
> 0) of the upper half-plane represent isometries of the hyperbolic plane. For example, scaling 
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by the factor 2 makes all objects in the upper half-plane look twice as large. At the same time, 
all distances from the boundary also double. In effect, hyperbolic lengths stay the same. 


Horizontal translations ey) a (ae) of the upper half-plane also represent isometries of H?, 


and so do reflections on vertical lines. 


Calculating hyperbolic areas in the half-plane model 


If Euclidean lengths in the upper half-plane model have to be scaled by a variable factor of 1/ug 
to get the hyperbolic length, then Euclidean area has to be scaled by the factor (1/u2)?. So the 
hyperbolic area of a region R C H? is 


1 
area(R) = i —, dui dug. 
R U2 
Theorem. The area of a hyperbolic triangle with interior angles a, B,y is 
m-a—-B-y. 
Remark. In Lecture 8, I sketched a purely analytic proof for the fact 
that the angle sum in a hyperbolic triangle is always less than 7. A 


more visual argument can be made using the Poincaré disk model and 
moving one vertex of the triangle to the center of the disk. 


To prove the theorem, consider first a hyperbolic triangle 
T(a, 8,0) with one vertex “at infinity”. (Strictly speak- 
ing, this is not a triangle but what we called a trilateral.) 
The figure on the right shows such a triangle in the Klein 
model, in the Poincaré disk model, and in the Poincaré 
half-plane (where the infinite vertex was used to project 
from the hemisphere model). 


U2 


cos 6 ee) 1 
= € a) au) du, 
ui=cos(7—a) ug=V1—u12 U2 


cos 3 1 oo 
Lo Con)* 
ui =cos(7—a) U2 l-uyz 


cos B 1 
tg 
u1=cos(m—a@) 1-uy 


cos B 


1 
area(T (a, 6,0)) = | —z dui dug 
T(a,8,0) 


= [ — arccos wu] =T7-a-—f. 


cos(7—a) 


For 6 > co one obtains the area of a triangle with two vertices “at infinity”: 


area(T(a,0,0)) =7—a. 
Now we can calculate the area for a triangle T(a, 8,7) with angles a, 3,7: 


area(T(a, 6, y)) = area(T'(a, 0,0)) — area(T (61,71, 0)) — area(T (Be, 0,0) 
={9— a) = (9 = 1 = 71) — r= Pa) 
=n-—a—(7-fi — fa) -(n9-1) =7-a- 8-4. 


This proves the theorem. 
Concluding remarks 


(1) All the models we have discussed exist also for higher dimensional hyperbolic space. 


(2) We have defined hyperbolic space as one sheet of a hyperboloid and then derived the other 
models from it. Actually, any metric space isometric to our H” is called hyperbolic space and 
AH”. The hyperboloid is just a model like the others, called the hyperboloid model. 
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Projective Geometry 
Introduction 


Consider projecting a plane E to another 
plane E’ from a point P not on EF or E’. 


Every point in & has an image in E’ except 
points on the vanishing line of E, which is the 
intersection of F with the plane parallel to E’ 
through P. Every point in E’ has a preimage 
in E except points on the vanishing line of E’, 
which is the intersection of E’ with the plane a I 
parallel to E through P. 


The projection maps lines to lines. A family of parallel lines in FE is mapped to a family of lines 
in E’ which intersect in a point on the vanishing line. 


Idea: Introduce, in a addition to the ordinary points of E, new points which correspond to points 
on the vanishing line of E’. In the same way, introduce new points of E’ which are images of the 
vanishing line of E. These new points are called points at infinity, and the extended planes are 
called projective planes. The projection becomes a bijection between projective planes. Parallel 
lines in F intersect in a point at infinity. The points at infinity of F form a line called the line 
at infinity which corresponds to the vanishing line of E’. 


Drawing a floor tiled with square tiles 


Suppose you have already drawn the first tile. (I don’t want to go 
into the details of how one can construct the image of the first tile, 
even though that is interesting and not difficult.) The figure shows 
how the other tiles can then be constructed. 


Analytic treatment 
Suppose F is the 1129-plane, E’ is the x273-plane, and P = (7). A point A = (=) eB 


0 
is mapped a point A’ = (») € E’, and by solving A’ = P+ t(A — P) for t one finds that 
2 
Y= ot and yo = arene So in terms of the coordinates 71,22 of plane F and y1, yo of plane 
E", the projection is the function 


(G2) =f (22) = sca (24)- 
The vanishing line of EF is the line x; = —1, and the vanishing line of E’ is the line yp = 1. 
Introduce homogeneous coordinates: Instead of using two numbers (Be ) to describe a point in E, 


U1 . 
use three numbers (2) such that x1; = ae and rg = ae The homogeneous coordinates for a 
is 


x 
point are not unique: (= ) are homogeneous coordinates for the point (73), but for any A # 0, 
AL 
(a: ) are also homogeneous coordinates for the same point. In the same way, use homogeneous 
r 
Vv 
coordinates (2) with y; = {and yo = % to describe a point (#3) € E’. Let us write the 


: ey . : U1 5 
projection f in terms of homogeneous coordinates. Let (2) be homogeneous coordinates for 
3 


(2) ) and let (%) be homogeneous coordinates for ($3) = f(2}). Then 


U1 a= v2 _ us = U2 

hat ween” _ —_ ’ 

U3 ry+i1 ae +1 U1 + U3 

U2 = U1 

U3 on Uy + U3 : 

so we may choose 

U1 U2 0 1 0 Uy Uy 
v2 = U1 = 1 0 0 U2 =: f U2 
U3 U1 + U3 1 01 UZ UZ 
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Using homogeneous coordinates, the projection may thus be written as a linear map f :R? > R. 
Moreover, f is bijective! Points on the vanishing line x, = —1 of EF have homogeneous coordinates 


UU Aa Vv 
(% ) with u;+u3 = 0, and f maps these to vectors ( "2 ) , which are not homogeneous coordinates 


for any point in £’. But we can interpret them as homogeneous coordinates for a point at infinity 
of the extended plane. 


Thus, each non-zero vector u € R® represents a point in a projective plane (of which it is the 
vector of homogeneous coordinates) and two vectors u, u’ represent the same point if and only if 
u’ = du for some A # 0. 


Projective geometry 


Projective geometry deals with the properties of figures that remain 
unchanged under projections. An example for a theorem of projec- 
tive geometry is Pappus’ theorem. It talks only about points, lines, 
and the incidence relation between points and lines. 

We will see that a curve being a conic section is a projective prop- 
erty. But the distinction between circles, ellipses, parabolas and 
hyperbolas is not. The photograph shows a circle being projected 
to a parabola. 

Also the distinction between ordinary points and points at infinity 
is not a projective property, because as we have seen, a projection 
can map ordinary points to points at infinity and vice versa. So 
from the point of view of projective geometry, points at infinity of 
a projective plane are not distinguished from ordinary points and 
the line at infinity is a line like any other. 


Pappus’ theorem 


Basic definitions 


Let V be a vector space over a field F’. The projective space of V is the set P(V) of 1-dimensional 
subspaces of V. If the dimension of V is n+1, then the dimension of the projective space P(V) is 
n. A 1-dimensional projective space is called a projective line and a 2-dimensional one is called 
a projective plane. An element of P(V) (that is, a 1-dimensional subspace of V) is called a point 
of the projective space. 


Ifv € V\ {0}, then we write [v] := span v. So [v] is a point in P(V), and v is called a representative 
vector for this point. If A 4 0 then [Av] = [v] and Av another representative vector for the same 
point. 


Suppose we chose a basis v1,...,Un41 of V. This gives an identification of V with F"*! and of 
P(V) with P(F"*"). A vector v € V has a basis representation 
n+1 
v= ye LjVU5 
j=l 
and %1,...,%,41 € F are the coordinates of v with respect to the basis. These coordinates of 


v € V are the homogeneous coordinates of the point [v] € P(V). If A 4 0, then Aqy,...,Atn441 
are also homogeneous coordinates of [v]. 


Let U Cc P(V) be the subset of points for which a particular homogeneous coordinate, say 2n+41, 
does not vanish: U = {[v] € P(V) |v = Set ay, with tn41 #0}. Then the map U,+41 > F”, 


j=l 
uy @1/Ln41 ur 
uz £2/fn41 uz ig 
ve>]. f= . is a bijection with inverse _ fro y UjVj + Un41; 
Un @n/Ln41 Un jt+1 
and uj1,...,Un are called affine coordinates of v € Un41. 


We will mostly consider the case where the base field F' of the vector space is the field of real 
numbers R. In this case, the concepts of a point, line or curve, etc., have their intuitively 
geometric meaning. But many theorems of projective geometry hold for arbitrary base fields. In 
particular, when dealing with curves and surfaces defined by algebraic equations, it is natural to 
use the base field C. Finite fields are used in elliptic curve cryptography. 


One usually writes RP” for P(R"*) and CP” for P(C"*). More generally, if V is any real or 


complex vector space, then P(V) is called an RP” or CP”, respectively. 
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Basic examples 


The points of RP!, the one-dimensional real projective space or the 
real projective line are the 1-dimensional subspaces 


[23] =R() CR’, — where (33) £0. 


The points with homogeneous coordinate x2 4 0 are described by 
one affine coordinate x2;/a2 € R. On the other hand, all represen- 
tative vectors (“f ) € R?\ {0} represent the same point [}] € RP!. 
So one can think of RP! as R plus one additional point (which is 
reasonably denoted by oo). 

The points of RP?, the two-dimensional real projective space or the 
real projective plane are the 1-dimensional subspaces 


[#3] =R(#) CR, where (2) £0. 


x3 
The points with homogeneous coordinate x3 4 0 are described by two affine coordinates 11/23, 
x2/x3. On the other hand, the points [=| form the 1-dimensional real projective space P(U), 


where U C R® is the subspace U = {(#) € R°}. So RP? can be thought of as R? plus a 
projective line. 


In general one can think of RP” as R” plus an additional RP". 
In the same way, the complex projective line CP! is C plus one additional point oo. In complex 


analysis, CP! = C U {oo} is called the extended complex plane and denoted by C. 


Projective subspaces 


A projective subspace of the projective space P(V) is a projective space P(U), where U is a vector 
subspace of V. If k is the dimension of P(U) (that is, k +1 is the dimension of U), then P(U) 
is called a k-plane in P(V). In particular, for k = 1 it is called a line, for k = 2 a plane and for 
k =n—1 a hyperplane in P(V). 


Exercise. How many points are there in the projective plane P(Z2*)? How many lines? How 
many points does each line contain? How many lines pass through each point? 


Proposition. Through any two distinct points in a projective space there passes a projective line. 
Proposition. Two distinct lines in a projective plane intersect in a unique point. 
(Proofs by linear algebra.) 


In general, if P(U,) and P(U2) are two projective subspaces of P(V), then the intersection 
P(U,) N P(U2) is the projective subspace P(U; M U2). The projective span or join of P(U,) and 
P(U2) is the projective subspace P(U, + U2). 


Exercise. Show that a point is in the join of two projective subspaces if and only if it is on a 
line joining a point in one of the subspaces with a different point in the other. (Actually, there 
one a very degenerate case in which this is not true.) 


From the dimension formula of linear algebra, 
dim(U, + U2) = dim U; + dim U2 — dim(U, M U2), 
one obtains 


dim(P(U, + Uz)) = dim P(U;) + dim P(U2) — dim(P(U;) N P(Us)). 
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Desargues’ theorem 


Theorem (Desargues). Let A, A’, B, B’,C,C’ be points of a projec- 
tive plane such that the lines AA’, BB’ and CC’ intersect in one 
point P. 


Then the intersection points C"” = AA'Q BB’, A” = BB'NCC' 
and B" =CC'N AA’ lie on one line. 


If this theorem is considered in an affine plane instead of a projec- 
tive plane, it breaks up into several special cases which have to be 


considered separately. For example: Under the same conditions, if 
AB || A'B' and BC || B’C’ then CA || C'A’. 


Some preparation is necessary before the following proof. Let P(V) be an n-dimensional projec- 
tive space. Then n+ 2 points in P(V) are said to be in general position if one (hence both) of 
the following equivalent conditions are satisfied: 


(i) No n+1 of the points are contained in an (n — 1)-dimensional projective subspace. 
(ii) Any n+ 1 of the points have linearly independent representative vectors. 


So three points on a line are in general position if they are distinct, four points in a plane are 
in general position if no three of them lie on a line, and five points in a 3-dimensional projective 
space are in general position if no four of them lie in a plane. 


Lemma. Let P(V) be an n-dimensional projective space and suppose P;,...Pn42 € P(V) are 
in general position. Then representative vectors v1,...,Un+1 € V may be chosen so that 


Uy + U2 18+ + Unt = Un+2: 
This choice is unique up to a common factor. That is, if 01,...,Un41 ts another choice of 


representative vectors with 0, + 02 +++: +%n41 = Onto, then tp = rAv~K for some A # 0. 


Proof of the lemma. Let w1,...,Wn+2 be any representative vectors for the points P,,...Phy+2. 
They are linearly dependent because dimV =n+1. So 


nt+2 


) ajwj = 0 
j=l 


for some a; which are not all zero. In fact, no az can be zero, because that would mean that 
there are n + 1 among the w; which are linearly dependent. Hence we may choose 


Ui = 41U1, V2 =aAQW2, +--+. Un4+1 = 4n41Wn41, Un+2 = —4n41Wn+4+1- 


To see the uniqueness claim, suppose A101,...,An42Un+42 is another choice of representative 
vectors with ee AKvE —An+2Un+2 = 0. This amounts to a system of equations of rank n+ 1 for 
the n+2 variables \;,. So the solution space is 1-dimensional and hence A; = Ag =... = An+2.- 


Proof of Desargues’ theorem. If A, A’, P are not distinct the statement of the theorem is obvious. 
(Check this.) So we may assume that A, A’, P are distinct and also B, B’, P and C,C’, P. But 
then A, A’, P are three points on a line in general position. So by the lemma we may choose 
representative vectors a,a’,p € V with a+a’ = p. For the same reason we may also choose 
representative vectors b, b’ and c,c’ so that b+ b' =p andc+c' =p. Then 


ata =b4+0 =c4+d. 


This implies a — b = b’ — a’. Obviously, the vector a — b = b’ — a’ is in the span of a and b and 
also in the span of a’ and b’. So the point [a — 6] = [b’ — a’] € P(V) lies on the line AB and on 
the line A’B’, hence it is the point of intersection, C”. Similarly, A” = [b—c] = [c — 6’), and 
B" = [c-— a] =|a’ — Cc]. But 


(a—b) + (b—e) + (c—a@) =0, 


which means that vectors (a — 6), (b — c),(c — a) are linearly dependent and so they span a 
subspace of dimension at most 2. Therefore, C’”, A” and B” lie on a line. 
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Desargues’ theorem says: 


If the lines joining corresponding points of two triangles meet 
in one point, then the intersections of corresponding sides lie 
on one line. 


The converse is also true: 


If the intersections of corresponding sides of two triangles lie 
on one line, then the lines joining corresponding points meet in 
one point. 


Surprisingly, the converse statement is in fact equivalent to the 
original statement after a permutation of the point labels (see 
figure right). 


The Desargues configuration turns out to be very symmetric 
in the sense that there are many permutations of the labels 
A, B,C, A’, B’,C’, A”, B”,C” and P preserving the relevant in- 
cidences. 


Desargues’ theorem also holds for triangles in two different 
planes of a 3-dimensional projective space P(V). In this case, 
it can be proved without any calculations: The intersection 
points of corresponding sides lie on the line in which the planes 
of the two triangles intersect. 


The planar version of Desargues’ theorem can also be proved without any calculations if the 
third dimension is used: 


Proof (a 3d proof of Desargues’ theorem). Let EF’ be the plane of the two triangles ABC, A’ B’C’ 
and the point P. Choose a line through P which is not in F and two points X and Y on it. The 
lines XA and Y A’ lie in one plane, so they intersect in a point A. Similarly, let B= XBNYB' 
and C = XCN YC’. Now the intersection of the line AB and the plane E lies on the line AB, 
because the plane XAB intersects E in AB. Similarly, ABN E also lies on the line A’B’, so 
ABN E =C". In the same way, BON E = A" and CAN E = B". Hence A”, B” and C” lie on 


the line where F intersects the plane ABC. 
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The preceding proof also suggests the following 3-dimensional way to generate any planar De- 
sargues configuration. This construction also reflects the high degree of combinatorial symmetry 
of the configuration. Let P,, Po, P3,.P,, Ps be five points in general position in a 3-dimensional 
projective space, and let FE be a plane that contains none of these points. Let J;; = 1;; be the 10 
lines joining P; and P; (i # 7). The 10 points P;; where these lines intersect E form a Desargues 
configuration. If (7, j,k,r,s) is any permutation of (1,2,3,4,5), then the points P;;, Pjx, Pri al- 
ways lie on a line (the intersection of the plane P;P;P;, with E), which we denote by gs. Any 
one of the points P,,, lies on the line gz, if the four indices uvry are different. So there are three 
lines through each point and three points on each line. Corresponding points of the triangles 
Pir, Pir, Per and Pis, Pjs, Pes are joined by the lines gx, gxi, 913, which all pass trough P,,. The 
intersection points of corresponding sides all lie on the line g,... 


The same Desargues figure contains therefore 5-4/2 = 10 pairs of triangles satisfying the condition 


of Desargues’ theorem. 


Pappus’ theorem 


Theorem (Pappus). Let A, B, C be points on one line in a 
projective plane P(V), and let A’, B’, C’ be points on another 
line. Then the points 


C" = AB'NA'B, AN =BC'NB'C, BY =CANCA 


lie on a line. 


Proof. At most one of the points A’, B’,C’ may lie on the line through A,B,C, so we may 
assume without loss of generality that A, B, B’,C’ are in general position. Then we may choose 
representative vectors a,b,b/,c! € V witha+b+0' =c’. Now a,b,b' form a basis for V in which 


the homogeneous coordinate vectors for A, B, B’,C’ are (3) , (:) : (8), ( 1 ) . SO we may assume 


without loss of generality that 


Then 


for some c and a. Now 


at ={[H]|s=0} ant ata ={|o(}) +4 


oes ee ioe cai ) +o 
sala 
) 


(where B” is obtained by a quick calculation on the side). Now 


(8) +6(2)- (ED <0 


so the vectors are linearly dependent and hence A”, B’’, C” lie on a line. 
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The synthetic approach to projective geometry 


We have defined a projective space P(V) of a vector space V over a field as the set of 1-dimensional 
subspaces of V. Of course this definition is based on basic axioms of algebra: the field axioms and 
the vector space axioms. This section provides a rough outline of how the theory of projective 
spaces P(V) can be based on geometric axioms. 


The following definition of a projective space in terms of geometric axioms (due to Oswald Veblen 
& John W. Young, 1908) is not equivalent to our definition of a projective space of a vector space 
over a field. (One obtains an equivalent definition if Pappus’ theorem is added as an independent 
axiom; see the structure theorem below.) 


A projective space P = (P,L) is a set P, the elements of which are called points, together with 
a set £ of subsets of P, which are called lines, such that the following axioms are satisfied. 


AXIOM 1. For any two distinct points there exists a unique line which contains both points. 

AxIomM 2. If A,B,C € P are three distinct points and | € CL is a line that intersects the lines 
AB and AC in distinct points, then I intersects the line BC. 

AXIOM 3. Every line contains at least three points. 


Axiom 1 implies that two lines intersect in at most one point. Axiom 2 is a clever way of saying 
that two lines in one plane always intersect without first defining what a plane is. 


A projective subspace of P is a subset U C P of points such that the line through any two points 
of U is contained in U/. Together with the subset of lines {1 €¢ £|1 C U}, the subspace is a 
projective space in its own right. The intersection of two projective subspaces is a projective 
subspace. 


If S C P is any set of points, then the projective span of S is the smallest projective subspace 
containing S, or equivalently, the intersection of all projective subspaces which contain S. 


The dimension of the projective space P is the smallest number n for which there exist n + 1 
points P,,..., P41 € P such that P is the projective span of {Pi,..., Pris}. 


The Axioms 1-3 together with the assertion that the dimension of P is 2 are equivalent to the 
following axioms for a projective plane. (Can you prove this equivalence? It is a little tricky.) 


AXIOM P1. Same as Axiom 1. 

AXIOM P2. Any two lines have non-empty intersection. 
AXIOM P3. Same as Axiom 3. 

Axiom P4. There are at least two different lines. 


If the dimension of P is at least 3, then Desargues’ theorem can be deduced from Axioms 1-3. 
The 3D proof of the last lecture works in this setting, it uses only the incidence relations between 
points, lines and planes, and does not involve any calculations. However, there are projective 
planes in which Desargues’ theorem does not hold. The purely 2-dimensional proof does not 
work here because it is based on calculations. 


A projective plane in which Desargues’ theorem holds is called a Desarguesian plane. 


Theorem (Veblen & Young). Any projective space in which Desargues’ theorem holds (that is, 
any projective space of dimension > 3 and any Desarguesian plane) is isomorphic to a projective 
space P(V) of a vector space V over a skew field F. If Pappus’ theorem also holds in P, then F 
is a field. 


(Two projective spaces are isomorphic if there is a bijection between their points that maps 
lines to lines. A skew field satisfies all field axioms except that the multiplication may not be 
commutative. You may check that our computational proof of Pappus’ theorem does not work 
if multiplication is not commutative.) 


In a projective plane, Desargues’ theorem can be deduced from Pappus’ theorem. (This was 
demonstrated by Hessenberg in 1905. It is not obvious at all.) 


Thus, any theorem that holds in any projective space P(V) of a vector space V over a filed can 
also be deduced from Axioms 1-3 together with Pappus’ theorem as independent axiom, and 
vice versa. Further axioms of order and of continuity have to be added to single out the real 
projective spaces RP” (just like further axioms have to be added to the general field axioms to 
single out field of reals). 
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Projective transformations 


Let V,W be (n+ 1)-dimensional vector spaces, and let f : V — W be an invertible linear map. 
Since f maps any 1-dimensional subspace [v] C V to a 1-dimensional subspace [f(v)] C W, it 
defines an invertible map P(V) — P(W). A map between projective spaces which arises in this 
way is called a projective transformation: 


A map f : P(V) > P(W) is a projective transformation if there is an invertible linear map 
f:V—W such that f([{v]) = [f(v)]. 


A projective transformation maps lines in P(V) to lines in P(W) and generally k-planes to 
k-planes. 


In homogeneous coordinates, a projective transformation is represented by matrix multiplication: 


A point in P(V) with homogeneous coordinates « = ( ) is mapped to the point in P(W) 


En+1 
with homogeneous coordinates y = Ax for some invertible (n + 1) x (1 +1) matrix A. In affine 
coordinates uj = %;/@n41, Wi = Yi/Ynqi (¢ = 1,...,n) the map is a so-called fractional linear 
transformation: 


n 
doja1 Gig ty + in41 


a : 
jai An41 jj + An+in41 


Wi= 
Each w; is the quotient of two affine linear functions of the u;, where the denominator is the 
same for all 7. 


Proposition. Two invertible linear maps f,g: V — W give rise to the same projective trans- 
formation P(V) > P(W) if and only if g = Af for some scalar \ #0. 


Proof. “=”: If g = Af, then [g(v)] = [Af(v)] = [f(v)]. 


“=5”: Suppose [g(v)] = [f(v)] for all v €e V\ {0}. This implies g(v) = A(v) f(v) for some non-zero 
scalar \(v) which may a priori depend on v. We have to show that it does not. So suppose 
v,w € V \ {0}. If v,w are linearly dependent, then it is obvious from the definition of A(v) that 
A(v) = A(w). So assume v, w are linearly independent. Now 


gv + w) = gv) + gw) = Av) fv) + Aw) Fw) 


but also 
gu +w) =Avt+w)futw) =Aw+ w)(F(r) + f(w)). 


Since f(v) and f(w) are also linearly independent this implies A(v) = A(v + w) = A(w). 


The projective transformations P(V) + P(V) form a group called the projective linear group 
PGL(V). It is the quotient of the general linear group GL(V) of invertible linear maps V > V 
by the normal subgroup of non-zero multiples of the identity: PGL(V) = GL(V)/{AT} ago. 


Example 1. Affine transformations 


Consider an affine transformation R” > R",u Mu+b with M € GL(n,R) and b € R”. In 


homogeneous coordinates 21,...,% 41 with uj = Lae this map can be written 

Ty TH 

: M b : 

—_ : 
qn In 

Tn+1 0 0 1 En+1 
SS 
A 


The affine map «+ Mz +6 can be extended to the projective transformation 
RP” > RP”, [a] 4 [Ag]. 


It maps the plane at infinity 7,4, = 0 to the plane at infinity. Conversely, any projective 
transformation which maps the plane x,4; = 0 to itself corresponds in the affine coordinates u 
to an affine map R” > R”. 
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Example 2. Central projection 


Let E; = P(U,) and E, = P(U2) be two hyperplanes in a projective space P(V) (for example, 
two lines in a projective plane) and let W € P(V) be a point not in FE, or Ey. Then the central 
projection from E, to Ez with center W is the map f : £, > E2 that maps a point A € EF, to 
the intersection of E2 with the line through W and A. 

Proposition. The central projection ti is a projective transformation Ey > Ez. 

Proof. We have to show that f comes from an invertible linear map f : U; — U2. Note that W, 
as point in P(V), is a 1-dimensional subspace of V. Since it does not lie in E2, WM U2 = {0}. 
This means that V is the direct sum 


V=W 6th, 

and there are two linear maps pw : V > W and py, : V > U2 (the projections onto W and 
Uz) such that for any v € V, pw(v) and py,(v) are the unique vectors in W and U2 such that 
v = pw(v) + pu, (v). 

Claim: The central projection i comes from the linear map py,|y,, the restriction of py, to U4. 
To see this, let a € U, be a representative vector of A € E,. Then py,(a) 4 0, because py, (a) = 0 
would mean a € W, but U; 7 W = {0} because by assumption FE; does not contain W. This 
shows that py,|v, is invertible, because it an injective linear map U, > U2 and dim U, = dim Up. 
Now py, (a) € Us, so [py,(a)] € Ez. Also a = pw(a) + pu, (a), or 

PU2 (a) =a— pw(a), 


so pu, (a) € [a] + W, which means that [py,(a)] is in the (projective) line through A € P(V) and 
W € P(V). Hence [py,(a)] is the intersection of E2 with the line through W and A, so it is the 
image of A under the central projection. 


One can consider also more general types of projections. For example let J, and lz be two lines 
in a 3-dimensional projective space P(V), and let Ip be a line that does not intersect 1; or Io. 
Then the projection J; — Ig with the line Jp as center of projection is defined as follows: A point 
A € |, is mapped to the intersection of l2 with the plane spanned by Jp and A. This map J, > I» 
is also a projective transformation, and the proof is the same (apart from obvious modifications). 


Most generally, in an n-dimensional projective space P(V), one can project one k-plane FE to 
another k-plane E2 from any (n — k — 1)-plane Ec which does not intersect E, or E2 as center 
of projection; and this is a projective transformation EF, — E%. 


Theorem. Let P(V) and P(W) be two n-dimensional projective spaces and suppose 
Aj,..-,Any2 € P(V) and B,,...,Bnsa € P(W) 

are in general position. Then there exists a unique projective transformation f : P(V) + P(W) 

with f(A;) = B; fori=1,...,n4+2. 


Proof. Existence: By the lemma of Lecture 13, we may choose representative vectors a1, ...@n+2 
for Ai, seey An+2 and b1, seey br+2 for By, seey Bni2 such that eal ay = An+2 and ee b; = 
bn42. Also by the general position assumption, a1,...,@n41 and b1,...,6,41 are bases of V 
and W, respectively. Hence there is an invertible linear map f : V > W with f(a;) = }; for 
i=1,...,n+1. But then also 


(ato Da fa a es: 


So f maps the 1-dimensional subspaces A; = [a;] C V to B; = [b;] C W fori =1,...,n +2. 


Uniqueness: Let g : V — W be another invertible linear map with g(a;) € B; fori =1,...,n+2. 
Then 6; = g(a;) would be another set of representative vectors for the B; with 


ue Gane) =9 5. a= Oa eT be 


By the uniqueness part of the lemma from Lecture 13, this implies b; = Ab; for some » # 0, so 
g =Af, and g and f induce the same projective transformation P(V) + P(W). 


3l 


Boris Springborn Geometry I Lecture 16 Winter Semester 07/08 


Theorem. Let 1,,l2 be two different lines in a projective plane P(V). A projective transformation 
1, — lg is a central projection if and only if it maps the intersection 1, M Ig to itself. Otherwise 
it is the composition of two central projections 1, > | > ly. 


Proof. 


One can classify projective transformations f : P(V) — P(V) according to the normal forms 
of the corresponding linear maps f : V > V. Note that fixed points of f correspond to 1- 
dimensional eigenspaces of f. Let us consider a projective transformation f : RP! > RP! 
corresponding to the linear map f : R? + R?. The following cases are possible: 


1. f has two distinct real eigenvalues. Then it has two linearly independent eigenvectors. So the 
projective transformation f has two fixed points. The eigenvectors of f form a basis of R? in 
which f has the matrix (i - ). In the affine coordinate u of RP! corresponding to this basis, 


the fixed points are 0 and oo and the projective map f is the scaling transformation u tH Shu. 
In other affine coordinates, [ looks like the projected image of a scaling transformation. 


2. f has two linearly independent eigenvectors to the same eigenvalue. Then f is a multiple of 
the identity and f is the identity. 


3. f has a double eigenvalue but only one linearly independent eigenvector, so f has only 
one fixed point. Then there is a basis in which the matrix of f has the Jordan normal form 
(24) =A(§ 14>). In the affine coordinate u corresponding to this basis, the fixed point is 00 


and f is the translation uty u+ t: In another chice of affine coordinate, f looks like a projected 
image of a translation. 


4. f has a pair of complex conjugate eigenvalues. Then there is a basis of R? in which the matrix 
of f has the form (¢~—°) = Va? + 6?(O8? “S"*) for some y, and we may forget about the 


b siny cosy 
factor Va? + b?. So f comes from a linear map which looks in some basis of R? like a rotation. 
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A proof of Pappus’ theorem using central projections 


Let X = ACN C” A”. We have to show 
X = B”. Consider the sequence of central 
projections 


A?’ 43123 BO Sac’, 


where the centers of the projections are 
written above the arrows. They map 


Ar>Ar>oR road 
Qro>BroB req 
Beso AN SX 
Coro PRO O ROC. 


So the composition is a projective transformation AC’ — AC’ which fixes the three points 
A, Q,C", so it must be the identity. It also maps B” to X, so B”’ = X. 


The cross ratio 


The cross ratio of four distinct points P; = [4] € RP! (¢=1,...,4) is the number 


= det (it 2 ) det Ca 8 (x1y2 — fayi)(@3y4 — Lays) 


cr(P,, Po, P3, Py) := a n= : 
Pas Pas Pos Pa) = aos (aaa) det (3251) — (Gas — eaya)(eatn — 2144) 


It is well defined because the expressions on the right hand side do not depend on the choice of 
representative vectors (x;, y;) but only on the points P;. 


To derive an expression for the cross ratio in terms of the affine coordinate u = we assume at 


first that no y; is 0 so that no u; is oo: 


yaiye (Sr — 32 )ysya(S2 — F4) (ur — ua) (ug — ua) 


yoys (22 — 2) yay (St — 2) (ua — uz) (ua — ur) 


cr(P,, Po, P3, Ps) = 


=: cr(U1, U2, Us, Ud). 


Even if one of the u; is oo, one gets correct results using this formula if one “cancels infinities” . 
For example, if y; = 0 so that u; = oo, one has 


cr(P,, Py, Pa, Py) = —Ui¥2@aus = @ays) wiyoyswa(ts — Ge) us — Us 
1; Pe 4) = = - 
a (ways — @3ys)(—21y4) —-yoys(—arya) (22 — 22) U2 — U3 


so the following calculation gives the correct result: 
=i 
(00TH )(ug — ua) U3 — U4 
cr(00, U2, Ug, Ua) = = 


(uz — U3) (U4—CO), U2 — U3 


Proposition. (i) If f : RP! + RP! is a projective transformation and f(P;) = Q;, then 
er(Pi, Py, Ps, P4) = er(Q1, Qo, Qs, Qa). 


(ii) If v,w is any basis of R? _and Xj, 4; are homogeneous coordinates for P; in this basis (that is, 
P= [Ev + gw) and u = i is the corresponding affine coordinate, then one may just as well 
use these coordinates to compute the cross-ratio: 
E192 — Lap) (F3Ha — Lay ti — U2) (tig — tt 
cr(P;, Po, P3, Pr) _ (192 bata) (Gaia tas) _ (ir lia) (tis ua) 
(G23 — Z3Y4)(Zayi — Z1y4) — (a — tts) (Wa — U1) 


All this works not only for the real projective line RP! but also for the complex projective line 
CP! and any other projective space P(V) of a 2-dimensional vector space V over any field. 
If v, w is a basis of V, then the cross ratio of four points P; with homogeneous coordinates + 
Z;,Y; in this basis is defined defined by the equation above, and this is independent of the 
choice of basis. 


.—= 
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Proposition. The cross-ratio cr(P,, P2, P3, P,) is the affine coordinate of the image of P, under 
the projective transformation that maps P2, P3, Py to the points with affine coordinates 0,1, 0. 


Corollary. The cross ratio of four distinct points can take all values except 0,1, 00. 


Proposition. There exists a projective transformation that maps four distinct points P,, Pz, Ps, P4 
of a line to four distinct points Q1,Q2,Q3,Q4 on the same or another line if and only if 


cr(P,, Po, P3, Ps) = cr(Qi, Q2, Q3, Qa). 


The cross ratio depends on the order of the points. How does it change if the points are permuted? 
e The cross ratio does not change if I simultaneously interchange two of the points and the 
remaining two: 


cr(uz, U2, Ug, U4) = Cr(U2, U1, U4, UZ) = Cr(UZ, U4, U1, U2) = Cr(Ua, UZ, U2, U1). 


This is easy to see from the equation for the cross ratio in terms of the u;. 


e Of the 24 permutations of uz, u2, ug, ua, I need therefore only consider the six which fix u; and 
permute Ug, U3, U4. 


e If2,7,k,l is a permutation of 1, 2,3,4, then 
cr(uz, U2, Ug, U4) = Cr(V1, V2, 03,04) <=> cr(uj, Uj, UR, UL) = cr (U4, Uj, Ves U1): 


Indeed, there is a projective transformation that maps uj, U2, U3, U4 tO V1, V2, V3, V4 if and only 
if there is one that maps wi, uj, Uk, UW tO Ui, V7, Uk, W- (It’s the same map in both cases.) 


e Hence if cr(wy, v2, U3, Us) = ¢ = cr(g, 0,1, 00), then 


cr(u1, U3, U2, U4) = er(q, 1,0,00) = 4 — wo =1-4q, 
ee eee 
cr(u1, Ua, U2, U3) = cr(g, oo, 1,0) = fo — ~ = - 
cr(u1, Ug, U4, U2) = cr(g, 1, 00,0) = a =! 7 : =1 - 
tte) = OD = ORI = Tae 
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Projective involutions of the real projective line 


For any four points A, B,C,D € RP! there is a unique projective transformation f : RP! —> RP! 
with f(A) = B, f(B) =A, f(C) = D, f(D) = C, because cr(A, B,C, D) = cr(B, A, D,C). The 
transformation f is an involution, that is, f 4 identity but fo f = identity. 


A pair of points {A,B} C RP! separates another pair {C,D} C RP! if C and D are in different 
connected components of RP! \ {A, B}. 


{A, B} separates {C, D} => cr(A,C, B,D) <0. 


The involution f has no fixed points if {A, B} separates {C, D}, otherwise it has two fixed points. 
If f has two fixed points P and Q, then for all X € RP, cr(X, P, f(X),Q) =-1. 

For any two points P,Q € RP! there is a unique projective involution of RP! that fixes P and Q. 
If A,B, P,Q are four points in RP!, then one says the pair {A,B} separates the pair {P,Q} 
harmonically, if cr(A, P, B,Q) = —-1. 


The complete quadrilateral 


Theorem (on the complete quadrilateral). 

Let A,B,C,D € RP? be four points in general posi- 
tion, let P= ABNCD, Q = ADN BC, = PQ, 
X=(N BD, Y =lNAC. Then the pair of points 
{P,Q} on @ separates the pair {X,Y} harmonically: 


er(P,X,Q,Y) =-1. 


Here are two proofs for this theorem, one computa- 
tional, one using a projective involution of the plane. 


Proof (by computation). Since the points are in general position, there is a projective transfor- 
mation of RP? that maps A, B,C, D to 


ie Wed Tesle ash 


If the theorem holds for the projected figure, it holds also for the original one. It is thus enough 
to verify the theorem for 


In this case, @ is the line x3 = 0, and P = [3]. Q= [3] X= [=]. Y= [3]. Homogeneous 
coordinates on @ are obtained by dropping the x3-coordinate (which is zero), so 


det (§41)det (24) 
det(H Pat (hs) 


er(P, X, Q, Y) — 
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Proof (using a projective involution of RP?). Since A, B,C, D are in general position, there is 
a projective transformation of the plane RP? that maps At> B, Bt A, CHD, DOC. It 
is an involution of RP? which maps the lines AB and CD onto themselves. It maps the line 
AD to BC and vice versa. Hence, the points P and Q are fixed, and the line @ is mapped to 
itself. Since the line AC is mapped onto BD and vice versa, X is mapped to Y and Y to X. 
Thus, the restriction to @ is an involution of @ with fixed points P,Q and interchanging X,Y. So 
er(P, X,Q,Y) = —-1. 


Projective involutions of the real projective plane 


Suppose f : RP? + RP? is a projective involution of 
the real projective plane. Let A € RP? be a point 
which is not a fixed point, and A’ = f(A). Then also 
A= f(A’) and hence the line AA’ is mapped to itself. 
Let B € RP? be a point not on this line which is also 
not a fixed point, and B’ = f(B). Then the line BB’ 
is also mapped to itself, so P = AA’M BB’ is a fixed 
point of f. 


The restriction of f to the line AA’ is an involution of AA’ with a fixed point P, so it has another 
fixed point Q, and this is the point such that {P,Q} separates {A, A’} harmonically. Equally, 
the restriction of f to the line BB’ is an involution of BB’ with fixed points P and R such that 
{P, R} separates {B, B’} harmonically. Now f fixes every point on the line = QR. (Can you 
see why?) Thus: 


Any projective involution of RP? has a whole line of fixed points and another fixed point P ¢ @. 


Conversely, if @ is a line in RP? and P is a point not on @, then there is a unique projective 
involution f that fixes P and every point on ¢. This is the projective reflection on ¢ and P. 


Indeed if X,Y are any two points on @, and any representative vectors of P, X,Y are chosen as 
-100 
basis of R°, then the matrix of f in this basis must be ( 9 10). 


(What does this reflection look like in an affine chart in which @ is the line at infinity? What 
does it look like if P is a point at infinity?) 


Happy holidays! 


artwork: Franz Pedit creative directors: Ilya & Lysander 
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The fundamental theorem of real projective geometry 


We know that projective transformations map lines to lines. For real projective spaces they are 
in fact all transformations that map lines to lines: 


Fundamental theorem of real projective geometry. If a bijective map RP” > RP” (n > 1) 
maps lines to lines, then it is a projective transformation. 


Remarks. e Note that it is not necessary to assume that the map is continuous. 
e A map RP” + RP” maps lines to lines if and only if it maps k-planes to k-planes. (Why?) 
e The corresponding statement for CP” is false. For example, the map CP” > CP”, 


41 21 


en+1 Pe | 


is bijective and maps lines to lines, but it is not a projective transformation. The fundamental 
theorem for general projective spaces P(V) of a vector space V over a field F' says the following: 
If f : P(V) > P(V) is bijective and maps lines to lines, then f comes from an almost linear map 
yp:V—>V. A map y: V > W between vector spaces over F’ is called almost linear if for all 
uve V,AEF, 

p(u+v)=plu)+y(v) and — v(Av) = a(A)p(v), 
where a: F - F is a field automorphism. (For example, complex conjugation is an automor- 
phism of C. The field R of real numbers has no automorphism except the identity.) 
Corollary. A bijective map f : RR" > R” which maps lines to lines is an affine transformation 
f(x) = Ax +b for some AE GL(n,R),b € R”. 
(Because any such map R” — R” can be extended to a bijective map RP” > RP” which maps 
lines to lines and the hyperplane at infinity to itself. (How’)) 


For simplicity, I will present a proof of the fundamental theorem only for the case n = 2 of the 
real projective plane. This already contains all the important ideas, so you can figure out for 
yourself how it works for n > 2. The proof depends on the following two lemmas. 


Lemma. Suppose a map f : RP? — RP? is bijective and maps lines to lines. Then if A,B,C, D 
are four points on a line in RP? and cr(A, B,C, D) = —1, then cr(f(A), f(B), f(C), f(C)) = -1 


as well. 


Proof. Use the theorem on the complete quadrilateral. 


Lemma. Suppose a map f : RP! — RP! of the line has the property that if A,B,C,D € RP! 
are four points with cr(A, B,C, D) = —1, then also cr( f(A), f(B), f(C), f(D)) = —-1. Then f is 


a projective transformation. 


Proof. We will show that if f also fixes 0, 1, and ov, it must be the identity. This implies the 
lemma: For general f let g be the projective transformation that maps f(0) > 0, f(1) > 1 
f(co) 4 oo. Then the composition go f satisfies the assumptions of the theorem and fixes 
0, 1,00. If it is the identity, then f = g~+ is a projective transformation. 


So assume in addition that f fixes 0, 1, oo. Then for all 7, y € R: 


(1) f(2#t) = LOH) | because er(x, 2, y, 00) = -1. 

(2) f Onyx = 2f(x), because cr(0, x, 2x7 cools -1. 

(3) f(~+y) = f(x) + fly). This follows from (1) and (2). 

(4) f(—a) = —f(@) because 0 = f(0) = f(a + (—#)) = f(x) + f(-2). 

(5) f(nx) =nf (a) for n € Z. This follows from (3) and (4). 

(6) f(qx) = ¢f (x) for q€ Q. This follows from (5). 

(7) f(a) =4@ for q € Q because f(g) = f(g: 1) =af(1) =¢-1. 

(8) f(x?) = f(x). This follows form (4) and cr(—a, 1, x, x). 

(9) « >0=> f(x) > 0. This follows from (8) because the a real number is positive if and only if 


it is the square of a real number. 
(10) f is increasing on R. This follows from (3,4,9) because 


O<e-y => 0<f(@—y)= fw) — fly). 
Finally: An increasing function on R which fixes the rationals is the identity. (Why?) 
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Proof (of the fundamental theorem, n = 2). We will show that if f also fixes the four points 
1 0 0 1 
P= [3] Pe=[3] P=([8] m= [3], 


it must be the identity. This implies the theorem: For general f let g : RP? + RP? be the pro- 
jective transformation that maps f(P;) to P;. Then the composition go f is bijective, maps lines 
to lines and fixes the points P;. If it is the identity, then f = g~+ is a projective transformation. 


So assume that f : RP? > RP? is bijective, maps lines to lines and fixes P,, P2, P3,P1. Let 
X € RP? be any point not on the line P,P, (which we consider as the line at infinity). We will 
show that f(X) =X. 


Let €; = P3P, and 02 = P3P 3. Since f fixes these 
points, it maps ¢; to @; and é2 to &2. By the lemmas, 
the restrictions f|e, : 4; > ¢; are projective transfor- 
mations. But fle, fixes P,, Ps, and Fy = P2Psan hy, 
so it is the identity. Equally, f|c, fixes P2, P3, and 
Ey = P, PyNea, so it is the identity. Hence f fixes also 
X, = PyX Ne, and Xp = PiXN&. Since f maps 
lines to lines, X = X1P2M X2P; implies f(X) = 
f(X1)f(P2) 9 f (Xe) f (Pi) = X1 Peon XeP, = X. 


We have shown that f(X) = X for all X not on P,P). 
But then it also fixes all points on P,P). (Why?) 
Hence, f is the identity. 


Localized version of the fundamental theorem. Let U be a subset of RP” that contains 
an open ball B C R" C RP”. Suppose an injective map f : U + RP” maps lines to lines in 
the following sense: If £ is a line in RP” which intersects U, then there is a line ¢’ such that 
fENU) =LNF(U). Then f is the restriction of a projective transformation of RP” to U. 


Again, for simplicity, I will present a proof for the case n = 2 only. This already contains all the 
important ideas so you can figure out for yourself how it works for n > 2. 


Proof. (for n = 2) Define a map f : RP? — RP? as follows. For X € B let f(X) = f(X). If 
X ¢ B, let ¢1, £2 be two lines through X that intersect B and let f(X) be the intersection of the 
lines @{ and £5, the images of ¢;, 2 under f (in the sense explained in the theorem). This point 
is well defined because it does not depend on the choice of ; and £2. To see this, use Desargues’ 
theorem to show that if 21, @2,¢3 are three lines that intersect B and all go through one point 
outside B, then their images under f intersect in one point. You have to convince yourself that 
you always have enough room in the open ball to construct (the relevant part of) a Desargues 
figure. (See left figure below.) 


We have defined f using only information about f on B. In fact, f coincides with f on U. 
(Why?) Further, f maps lines to lines. To see this, use (the inverse) Desargues’ theorem to 
show that f maps three points on a line to three points on a line. Again you have to convince 
yourself that you have enough room in B to construct (the relevant part of) a Desargues figure. 
(See right figure below.) Finally, by the fundamental theorem (global version), f is a projective 
transformation. 
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Duality 
In homogeneous coordinates £1, £2,273, the equation for a line in a projective plane is 
a1 21 + a2x2 + a3%3 = 0, 


where not all coefficients a; are zero. The coefficients a ,,a2,a3 can be seen as homogeneous 
coordinates for the line, because if we replace in the equation a; by Aa; for some  # 0 we get 
an equivalent equation for the same line. Thus, the set of lines in a projective plane is itself 
a projective plane, the dual plane. Points in the dual plane correspond to lines in the original 
plane. Moreover, if we consider in the above equation the x; as fixed and the a; as variables, we 
get an equation for a line in the dual plane. Points on this line correspond to lines in the original 
plane that contain [az]. Thus, a the points on a line in the dual plane correspond to lines in the 
original plane through a point. 


It makes sense to look at this phenomenon in a basis independent way and for arbitrary dimen- 
sion. It boils down to the duality of vector spaces. 


Let V be a finite dimensional vector space over a field F’. 
The dual vector space V* of V is the vector space of linear functions V + F (linear forms on V). 


If vj,...,Un is a basis of V, the dual basis of V* is yi,...,Yn with y;(v;) = 6:3. In particu- 
lar dimV = dimV*. But there is no natural way to identify V* with V. (“Natural”’ means 
independent of any arbitrary choices. In this case: choice of a basis.) 


There is, however, a natural identification of V with V**: A vector v € V is identified with the 
linear form V* > F, y+ y(v). With this identification, V is also the dual vector space of V*. 


Let f :V > W bea linear map. The dual linear map f* : W* > V* is defined by f*(q)(v) = 
w(f(v)). Note that the dual map “goes in the opposite direction”. If f is invertible, then 
ft '=f-" isa map V* > W*. 


If U C V is a linear subspace, the annihilator of U is the linear subspace 

U® = {we V* | v(v) =0 for all ve U} CV" 
of linear forms that vanish on U. 
This provides a correspondence between subspaces of V with subspaces of V*. 
The dimensions of U and U® are related by 

dim U + dimU° = dim V. 

Indeed, let v,,...,vz be a basis for U and extend it to a basis v1,...,Un of V. Let y1,..., Yn be 
the dual basis of V*. Then (one sees easily that) Qp41,---;Qn is a basis of U®. 


(In fact, the above dimension formula is just a coordinate free way of saying that each linearly 
independent homogeneous equation in the coordinates reduces the dimension of the solution 
space by 1.) 


If U, and Us, are subspaces of V, then 
(Uy NU2)° =U,° +0," and (Uy +U2)° =U,°nU2°. 
(Can you see this?) 


Now let P(V) be the n-dimensional projective space of an (n + 1)-dimensional vector space V. 
The dual projective space is P(V“*). 


A point [v] € P(V) corresponds to the hyperplane P([v]°) € P(V*), and a point [y] € P(V*) 
corresponds to the hyperplane P({y]°) in P(V). Note that the points of the hyperplane P({y]°) 
correspond to the hyperplanes in P(V) that contain [v]. 


In general, a k-plane P(U) C P(V) corresponds to the plane P(U°) C P(V*) of dimension 
dimU° —-1=dimV —dimU —1=(n+1)—-(k+1)-l=n—-k-1. 


The points in P(U°) correspond to the hyperplanes in P(V) that contain P(U). 
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Let us take another look at duality for projective planes. (Hyperplanes in a plane are lines.) To 
aid the imagination, let us focus on the real projective plane RP? = P(R®) and its dual plane 
P(R®") which we denote by RP?* (although everything holds in general). 


So each point in RP? corresponds to a line in RP?* and vice versa. The points on a line in RP? 
correspond to the lines through the corresponding point in RP?*. Lines through a point in RP? 
correspond to the points on the corresponding line in RP?*. 


Every theorem about RP? can also be read as a theorem about RP?*. This leads to the following 
duality principle: 

From every theorem that talks only about incidence relations between points and lines in a pro- 
jective plane, one obtains another valid theorem by interchanging the words “point” and “line” 
(and the phrases “goes through” and “lies on”). 


For example, the theorem that is obtained from the Desargues theorem in this way (the dual 
Desargues theorem) turns out to be the converse of Desargues’s theorem. 


Sh <E 


We had seen that the the converse of Desargues is equivalent to Desargues, so Desargues’s 
theorem turns out to be self-dual. The same is true for Pappus’s theorem. (Check it out.) 


Note that four lines through a point in RP? correspond to four points on a line in RP?*. But for 
four points on a line we had defined the cross ratio. Via duality this gives us a definition for the 
cross ratio of four lines through a point. 


P. 
Proposition. Let 1[,,l9,l3,l4 be four lines aby 
through a point P in RP?. Let 1 be a line not P. 
containing P and let P,, P2, P3, Py be the inter- P z bs 
sections of the four lines 1; with |. Then Py 
ly 
cx(ln, lasts, la) = cr(P1, Po, Ps, Pa). i 
1 


This proposition is an immediate consequence of the following one. 


Proposition. Let P be a point in RP? and let I* be the corresponding line in RP?*, so that each 
point of [* corresponds to a line through P. Let be a line in RP? that does not contain P. Then 
the map I* — | that maps a point of l* to the intersection of the corresponding line with | is a 
projective transformation. 


Proof. Let P = [v,], and let [vs], [v3] be two points on 1. Then v1, v2, v3 is a basis of R°. Let 
£1, 2, v3 be the dual basis of R°”. The line /* is spanned by [ys], [v3]. Hence the points [y] € I* 
have representative vectors yp = sy2 +ty3, and s,t are homogeneous coordinates on /*. The line 
in RP? corresponding to [yp] intersects | in a point [v] such that v = xv2 + yv3 and 


0 = y(v) = (spo + ty3)(xv2 + yu3) = sx + ty. 


This is the case for = t,y = —s. So the map /* — I in question comes from the linear map 
Sp2 + tpg > tug — 8v3. 
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Conic sections 


The Euclidean point of view 


The conic sections are ellipses (including circles), parabolas, hyperbolas, and the degenerate 
cases of a pair of lines, which may degenerate further to one “double” line, and a single point. 


ellipse hyperbola parabola 
ed y 
—a x ry 
f T2 
x 
fi = 
2 2 2 2 
O+-.  — @-@"= a 
a b a b ce 
ry +1r2 = const. |r; — r2| = const. i = 
f=vVe—P f=) ae Rae 
two intersecting lines two parallel lines one “double” line 
# y 
b a 
a " = 
—a 
r\2 y\2 
(3) -G) =° ie 
What do these curves have in common? 
1. They arise as intersection of a plane with a cone (or cylinder in the case of two parallel lines). 
Hence the name. 
The figure on the right illustrates Dandelin’s proof for the case of an ellipse. 
2. They are all described by quadratic equations in the two Euclidean coordinates. In fact: 
Theorem. The set of solutions of any quadratic equation in two variables u, v, 
au? + 2buv + cv? +du+ev+f =0 (*) 


is empty or a conic section. 


More precisely: There is a change of coordinates (#) = A(z) +t with A € O(2), t € R? which 
reduces (*) to one of the standard forms above. Do you still know how to prove this? 
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Optical properties of the conic sections 


Here are two proofs for the case of an ellipse: 


Proof #1 (more analytic). Let y : R — R? be a parameterization 
of an ellipse, like for example y(t) = (%°°8!). The direction of 


bsint 
the tangent at a point y(t) is 7’(¢) and 
ie _ F. se — F. 
eae 92 toe 
IlyII ly - Full IlyII lly — Fall 


Now ||y — Fi|| + || y — F2|| = const. implies 


0= (lly— Fill + lly — Folly’ 
N= AD) 4. eta i) 
Illy — Fall lly’ lv — Fall 
= COS 1 — COS Q2. 


Proof #2 (more synthetic). Let P be a point of the ellipse. Ex- 
tend the line segment F)P a distance of r; beyond P. Call the 
new endpoint of the extended segment F3. Claim: The tangent 
of the ellipse at P is the perpendicular bisector ¢ of F\ F5. Indeed, 
P lies on @ because it has equal distance r; from F, and F3. Con- 
sider any other point P on @ and let r, be its distance to both F 
and Fé and let #2 be its distance to Fy. Then 7 +72 >71 +12 
so P does not lie on the ellipse. Hence, @ intersects the ellipse in 
precisely one point, P, which proves the claim. Now the equality 
of the angles follows easily. 


The advantage of the second proof is that it suggests another theorem: 


Theorem. Let c be a circle with center Fy and let F, be a point inside c. The locus of the centers 
of all circles that go through F, and touch c is an ellipse with foci F, and F. 
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The projective point of view 
Five points determine a conic section 
Pencils of conic sections 


Pascal’s theorem 


‘To be completed 


See following handwritten notes by Alina Hinzmann.” 


“Thank you, Alina! 
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Inside and outside 
The pole-polar relationship 


The dual conic and Brianchon’s theorem 


‘To be completed 


See following handwritten notes by Alina Hinzmann.” 


“Thank you, Alina! 
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The rational paramteterization of conics 


Steiner’s projective generation of conic sections 


‘To be completed 


See following handwritten notes by Alina Hinzmann.” 


“Thank you, Alina! 
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Quadrics 


Quadrics are the generalization of conic sections to arbitrary dimension: They are the sets defined 
by one quadratic equation in the coordinates. Conic sections are the special case of quadrics in 
the plane. 


The Euclidean point of view 
A quadric in the Euclidean space R” is defined by a quadratic equation 
a? Az +b? 2+c=0, with A a symmetric n x n matrix, b€ R", cER. 


This can be brought to normal form by a Euclidean motion 2 4 Max + v, with M € O(n,R), 
v € R”. In R3, the following cases can occur:! 


ellipsoid elliptic paraboloid 2-sheeted hyperboloid 


SOON 


(2) + (BP + (2? =1 eat see a el 7 Bee 


1-sheeted hyperboloid hyperbolic paraboloid 


plus some degenerate cases: 


cones and cylinders over a conic 
two planes 

one “double” plane 

one line 

one point 

the empty set. 


(2)? + (#2 = (2)? =1 2= (2)? = (#) 
The projective point of view 
If q is quadratic form on a vector space V, q 4 0, then 
Q = {[v] € P(V) | a(v) = 0} 


is called a quadric in P(V). Since Q is defined by a quadratic polynomial in the homogeneous 
coordinates, the algebraic properties of the base field of scalars of V play an important role. We 
will only consider R and comment on C. 


In RP®, there are only three non-degenerate cases depending on the signature of g: 


(0) (++++) or (-——-—), the case of definite gq leading to an empty quadric Q = —). We exclude 
this case from now on. 
(1) ( +—) or (+—-——). In an affine image of RP?, Q looks like an ellipsoid, an elliptic 


paraboloid, or a 2-sheeted hyperboloid, depending on whether the plane at infinity does not 

intersect, is tangent to, or intersects the quadric (without being tangent). 

(2) ( ). In an affine image of RP*, Q looks like a 1-sheeted hyperboloid or a hyperbolic 
paraboloid, depending on whether the plane at infinity intersects (without being tangent) or 
is tangent to Q. (In this case, any plane meets Q.) 


If q is degenerate, let Up = ker q, and let U; be a complementary subspace. Then Q is the union 
of all lines joining a point in P(U) with a point in the non-degenerate quadric in P(U;) defined 
by the restriction q|u,. 


In CP”, there is up to projective transformations only one non-degenerate quadric. There are n 
degenerate ones, depending on the rank of g (which can be 1,...,7). 


1The three images in the first row are taken from Wikipedia. 
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Proposition. If Q is a (non-empty) non-degenerate quadric in RP”, then its defining bilinear 
form is uniquely determined up to a scalar factor. 


Proof. Suppose q and g determine the same quadric. Hence q(v,v) = 0 © g(v,v) = 0. We 
want to show g = Aq. Suppose q has signature (k,n +1—k), where 1 < k < n+1, and let 
€1,---,€k, fi,--+,fn41—-% be an orthonormal basis for q (with q(e;, e;) = 1 and q(fim, fm) = —1). 
We are done if we have shown that g(e;,e;) = Adij, (fm, fi) = —Admi, and G(e:, fm) = 0. To 
see this, first note that for anyi=1,...,k andm=1,...,n+1-—k, 


q(ei + fm, ei + fm) = aes, es) = 2G(€:, fm) + Uf, fm) = 9, 


so 


0= qe fim; Ci tas) =9(e;, e:) 4 2q(e:, fm) + W(fm; fm) 
This implies g(e;, fm) = 0 and (e;,e;) = —G(fm, fm) = A for some  € R independent of i, m. 
To see that G(e;,e;) =0 fori #7 and G(fm, fi) = 0 for m Al, make a similar argument starting 
with q(v,v) =0 for v = (e; £e;) + (fm + fi) (here the signs may be chosen independently). 


Inside and outside 


Any (non-empty) non-degenerate quadric divides RP” into two connected components. If the 
signature is (n, 1), then inside and outside can be defined in the same way as for conics: A point 
is inside iff every line through it intersects the quadric in two points. 


Question: How can the two components be distinguished in general if the signature is (k,m) 
with k Am? 


If the signature is neutral (k = m), the two components can be interchanged by a projective 
transformation. 


Lines in a quadric 


A line intersects a quadric in RP” either not at all, in two points, in one point, or it lies entirely 
in the quadric. In the last two cases, the line is called a tangent. In RP*, the only non-degenerate 
quadrics that contain lines are the ones with neutral signature (+ + ——). 

Proposition. Let Q be a quadric in RP® with neutral signature. Through any point in Q there 
are precisely two lines lying entirely in Q. 

Proof. To see that there are no more than two lines through a point [p] € Q lying entirely in 
Q, show that any such line must lie in the plane q(p, -) = 0, and note that the intersection of Q 
with a plane is a conic section, so it cannot contain more that two lines. 


To see that there are actually two such lines, we may assume (after a change of coordinates, 
if necessary) that Q is the quadric 27 + 23 — 23 — 27 = 0. This equation is equivalent to 
(a1 + #3)(@1 — %3) + (%2 + %4)(X2 — %4) = 0, and, after changing to new coordinates 
Yi=%1 +73, Yo=%1— 73, Y3=—(X2tX4), Ys =T2— 4, 
to 
yiy2 — ysya = 0. 
Now the map 
1 1 a] [8 te tits 
f: RP'xRP'—+Q, (([#],[2])— H = [| 


t1 82 


Y1 
is actually a bijection RP! x RP! < Q. Indeed, if H € Q, then [ 7! ] is determined by a = 
Ya 


or by a = oe (It can happen that one of the right hand sides is 8, but not both. If neither 
is 3, they are equal.) Similarly, [22 | is determined by ra = a or by a a ra For any point 


P= f(Pi, Po) € Q, the images of the functions f(P;, -) : RP! > Q and f(-, Po): RP! > Q are 
two lines through P lying entirely in Q. 


In fact this proof also shows: 


e Q contains two families of pairwise skew lines, and each line of the first family intersects each 
line of the second family. 


e Since RP! is homeomorphic to the circle $!, Q is homeomorphic to S! x $", so it is topologically 
a torus. 
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Polarity 


A non-degenerate symmetric bilinear form g on a vector space V defines a relation between the 
points and hyperplanes of P(V): To each point [v] € P(V) corresponds the polar hyperplane 


{[w] € P(V) | a(v, w) = Of, 
and to each hyperplane there is a corresponding point, its pole. Note that 


[x] € polar hyperplane of [y] <= [y] € polar hyperplane of [zt] <= 4q(z,y) =0. 


More generally, let U C V be a (k + 1)-dimensional linear subspace of V, and let n+ 1 = dimV. 
The orthogonal subspace of U (with respect to q) is 


U+ ={w eV | q(u, w) = 0 for all u € U}. 


The dimension of U+ is dimV — dimU = n—k, and Ut+ =U. The k-plane P(U) and the 
(n —k—1)-plane P(U)+ in P(V) are called polar to each other. Polarity (with respect to q) 
is therefore a one-to-one relation between k-planes and (n — k — 1)-planes in the n-dimensional 
projective space P(V). In particular, if n = 3, polarity is a relation between points and planes 
and between lines and lines. 


Proposition. Let Q be a (non-empty) non-degenerate quadric in RP” (CP") defined by the 
symmetric bilinear form q, and let X € Q, Y € RP” (CP”). Then 
The line XY is tangent toQ <= > X is in the polar hyperplane of Y. 


Proof. Let X = [x], Y = [y]. Then g(a,x) = 0 because X € Q. The line XY is tangent to Q 
either if it intersects Q in no other point but X, or if it is contained entirely in Q. The points 
on the line XY except X are parameterized by [tz + y] with t € R (C). Such a point lies in Q if 


0 = q(ta + y, tx + y) = t q(x, x) + 2tq(x, y) + a(y,y) = 2tq(a,y) + a(y,y)- 


This equation for ¢ has one solution if q(x, y) 4 0, it has no solution if g(a, y) = 0 and q(y, y) £0, 
and it is satisfied for all t if g(a, y) = ¢(y, y) = 0. So the line XY contains no other points of Q 
except X or lies entirely in Q precisely if g(x,y) = 0. 


This provides a simple geometric interpretation of the polarity relationship between points and 
hyperplanes in the case when the polar hyperplane intersects Q: The tangents from a point to 
the quadric touch the quadric in the points in which the quadric intersects the polar hyperplane. 


If a quadric in RP? is illuminated by a point light source 
outside the quadric (or by parallel light), the borderline 
between light and shadow on the quadric is a conic in 
the polar plane; and the shadow that the quadric throws 
on some other another plane is a projected image of this 
conic. 


What about the polarity between lines in RP?? If a 
point moves on a line, the polar planes rotate about a 
line, and these two lines are polar to each other. 


Projective transformations that map a quadric to itself 


Let Q be a (non-empty) non-degenerate quadric in RP” defined by the symmetric bilinear form q 
with signature (k,n+1—k). If f : R"*! > R"*? is a linear map which is orthogonal with respect 
to q (that is, g(x,y) = ¢(f(x), f(y)) for all z,y € R”**), then the projective map [2] + [f(«)] 
clearly maps Q to Q. If the signature is not neutral (that is, if k A n+1—k), then these are all 
projective maps that map Q to Q: 


Proposition. [f the signature is not neutral, then any projective transformation that maps Q to 
Q comes from a linear map which is orthogonal with respect to q. 


Hence, under the assumption of non-neutral signature, the group of projective transformations 
mapping Q to Q is PO(k,n+1—k), the projective orthogonal group for signature (k,n+1—k). 
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Proof. Suppose [z] +> [f(a)] maps Q to Q. This means that the symmetric bilinear forms q and 
g defined by q(x, y) = a(f(x), f(y)) define the same quadric. In the last lecture we saw that this 
means ¢ = Aq for some » € R \ {0}. Hence q(f(x), f(y)) = Aq(z,y) for all 2, y € R. We will 
show that \ is positive. Then Fr f defines the same projective transformation and is orthogonal 
with respect to g. Now to see that A is positive, let e1,...,€n41 be an orthonormal basis with 
respect to gq. Then f(e1),..., f(@n41) is still an orthogonal basis. If A were negative, it would 
contain n + 1—k spacelike and k timelike vectors. This cannot be, because every orthogonal 
basis contains k spacelike and n + 1 — k timelike vectors 


To see that the non-neutral signature assumption is really necessary, consider the quadric Q 


Li x3 
defined by x? + 23 — x3 — 23 = 0. The projective transformation | 72 | 1 E maps Q to Q, 
v4 x2 
but it does not come from an orthogonal map in O(2,2). Note that it also interchanges the two 
connected components of RP?\Q. The following is true in general: If a projective transformation 
maps a non-degenerate quadric to itself and each connected component of the complement of 
Q to itself, then it comes from an orthogonal transformation. (In the case of neutral signature, 
these form only a subgroup of index 2 within the group of all projective transformations mapping 


Q to Q). 

The projective model of hyperbolic space 

We had defined n-dimensional hyperbolic space in the hyperboloid model as 

H” = {x €R®** | (x, 2) =—1 and p41 >0} where (x,y) = r1y1 +-+++2nYn — 2n41Yn41; 


equipped with the metric d defined by cosh d(a, y) = —(a, y). 


In the projective model, n-dimensional hyperbolic space consists of the set H7,,; of points inside 


a quadric Q C RP” defined by a symmetric bilinear form q of signature (n, 1), 
proj = 112] € RP” | q(x, x) < 0}, 


equipped with the metric dp,.; defined by 


N2Yy 
cosh dyre( {el fy) = Xt _ 
Vax, ©)a(y, ¥) 
This is indeed a metric space isometric to H”. First of all, if we use in RP” homogeneous 
coordinates with respect to an orthonormal basis for g, then g = (-,-), and the map 
A” —+ H” 


proj? Tt [z] 


is bijective and an isometry. 


The projective model is actually the same as the Klein model (seen from the projective point of 
view, which was how Klein saw it in the first place): The points inside the unit circle—or unit 
sphere in higher dimension—are obviously the points inside a quadric of the right signature, and 
from the projective point of view we might as well take any other such quadric. 


The group of isometries of H7,,; is the projective orthogonal group 
PO(n, 1) of projective transformations mapping Q to Q (and hence 


also the inside of Q to the inside of Q). 


Via the polarity relation, points outside Q correspond to hyperplanes 
in H;,,.;- Two hyperplanes intersect orthogonally if one contains the 
pole of the other. (Do you see why? What is the formula for the 
angle between two intersecting hyperplanes polar to two points [v] 
and [w] outside Q? What is the formula for their distance if they 


do not intersect’) 


The following proposition is due to Arthur Cayley. (Actually, its history is usually told like this: 
Cayley discovered this way of defining a metric inside a quadric in terms of a cross ratio. Then 
Klein realized that this was a model for the non-Euclidean geometry discovered by Lobachevsky.) 


i 


Proposition. For two points A,B € Hy,;, let X,Y be the points 


where the line AB intersects the quadric Q, labelled such that X, B 
separates A,Y. Then 


dproj(A, B) = 3 log er(B, X, A, ¥). 
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Proof. First convince yourself that cr(B, X,A,Y) > 1, so that the logarithm is positive (see 
Lectures 17 and 18). We will prove the proposition by showing that 


ia(a,0)| 
Va(a, a)q(, 6) 
Suppose A = [a] and B = [}], and introduce an affine parameter on the line AB by t > [a + tO]. 


The points A and B correspond to the parameter values t = 0 and t = oo. The parameter values 
for X and Y are the roots t1,2 of the quadratic equation 


cosh(4 log cr(B, X, A, Y)) = 


0 =q(a+tb,a + tb) = q(a,a) + 2q(a, b)t + q(b, b)t?. 


On the one hand, cr(B, X, A, Y) = Sots = "a so 


cosh (4 log cr(B, X, A,Y)) = Gaia +e tes) =1(,/a + ,/4). 
On the other hand, q(a, a) + 2q(a, b)t + q(b, b)t? = q(b, b)(t — t1)(t — te) implies 


2q(a, b) 
q(b, b) 


Since ¢; and tz have the same sign, and so do q(a,a) and q(b,b), we get 


q(a, a) 
q(b, b) ” 


ti t+tz=—- and tytz = 


1 to | tr) _ \t1 + ta| = |q(a, b)| 
MJa+ V8) = 2Vtit2 — \/q(a,a)q(b,b) 
Klein’s Erlangen program 


For his inaugural lecture at the university of Erlangen in 1872, Felix Klein prepared a paper 
dealing with the question “What is geometry?”. In it, he breaks radically with the traditional 
point of view that geometry is the study of the “true” space around us. Instead, geometry should 
be viewed as 
the study of invariants under a group of transformations. 

Actually this was not meant as an abstract formal definition of the field of geometry. On the 
contrary, Klein saw a “joy in the pure form or shape” as the characteristic mark of a geometer, 
and he always emphasized the importance of space intuition. The point of the Erlangen program 
was to provide an organizing principle for the overabundant material that had accumulated in 
geometry, or rather, in the different geometries that had been discovered. 


Let us see how some familiar geometries fit into this scheme. Every particular geometry deals with 
properties of figures in some space that remain invariant under some group of transformations 
of the space: 


geometry space transformation group some invariant properties & quantities 


Euclidean [R” the group of Euclidean motions distance of two points, angle of two lines 
rr Ax+v, A€ O(n), bE R” 


projective RP” the group of projective transfor- the cross ratio, being a k-plane, being a 


mations, PGL(n + 1) quadric, incidence relations 
hyperbolic Hy; PO(n,1) hyperbolic distances and angles 
spherical S$” O(n +1) spherical distances and angles 
similarity R” the group of similarity transfor- angles, ratios of distances 


mations +> A\Art+v, A € Ryo, 
A€é O(n), v € R” 


affine R” the group of affine transforma- parallelism, ratios of distances between 
tions «4 Ax+v, A€ GL(n), points on a line 
uv € R” 


Actually, one does not distinguish between two geometries if there is a bijection between the 
spaces (or at least between subsets of them) which induces an isomorphism of the transformation 
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groups. Instead, one then speaks of different models for the same geometry. For example, 
we have considered different models for hyperbolic geometry: The points inside a quadric of 
signature (n,1) are in 1-to-1 correspondence with the points of the upper sheet of a 2-sheeted 
hyperboloid, and the projective transformations that fix the quadric correspond to the orthogonal 
transformations of R”! that map each sheet to itself. We have also seen the conformal Poincaré 
models, and we will soon discuss what the corresponding transformation groups are. 


Elliptic geometry is the geometry on the sphere but pairs of opposite points are considered as 
one point. So the space is S” with opposite points identified and the group is O(n + 1)/{+J}. 
(Both I and —I fix every point of this space.) Another model for elliptic geometry is RP” with 
group PO(n + 1), the group of projective transformations that fix a definite symmetric bilinear 
form. The advantage of elliptic geometry over spherical geometry is that two lines intersect in 
one point, and there is a unique line through every two points. 


Klein’s Erlangen program emphasizes the transformation projective 
group rather than the space on which it acts. If the we | <, 
transformation group of one geometry is a subgroup of 


the transformation group of another geometry, the first bypethelic. -elipus atae 


is called a subgeometry of the second. For example, the | 
space R” of Euclidean geometry can be seen as a sub- similarity 
set of RP”—the complement of a particular hyperplane i 
Zn+1 = 0 which is considered “at infinity”. The group Euclidean 
of Euclidean motions then corresponds to the group of 


projective transformations [44] with A € O(n). Euclidean geometry is thus a subgeometry 


of projective geometry. The diagram illustrates the subgeometry relationship for some familiar 
geometries. 


The invariants of one geometry are also invariants of any of its subgeometries 
(because the group is smaller). The same is true for theorems. Every theorem of 
projective geometry is also a theorem of Euclidean geometry. The converse is not 
true, but often one can see a Euclidean theorem as special case of a projective 
theorem. For example, the theorem on the inscribed angle over a chord of a circle 
is a special case of Steiner’s theorem on the projective generation of conics. 


Klein’s group theoretical point of view on geometry illuminates the relationship between the 
different geometries. This is not only of theoretical interest, it is a great practical help in the 
day-to-day business of geometric research. When confronted with a geometric problem, it is 
usually an excellent idea to ask first: In which geometry should I treat this problem? 


Let me illustrate this with a real-live example. A couple of months 
ago, Wolfgang Schief told me about the following striking theorem 
which he had discovered. It had somehow come up in his research 
on integrable systems and he had good reason to believe it was true. 


Theorem (W. K. Schief). Consider three pairs a1, a2; b1,b2; C1, C2 
of lines in the plane. If the four intersection points a,b; lie on a 
circle and the four intersection points bj 1c; lie on a circle, then the 
four intersection points c; aj; also lie on a circle. 


How to prove this? It looks like a theorem of Euclidean geometry, or 
more precisely, of similarity geometry. But it can also be interpreted 
in terms of projective geometry and this leads to a surprisingly sim- 
ple proof. Pairs of lines and circles are all conic sections. If we consider the Euclidean plane as 
the complement in RP? of the line 73 = 0, then the circles are characterized among the (non- 
empty) non-degenerate quadrics ay1x4 + 2a42%1%2q + 24137123 + a22X3 + 2a93%2%3 + 3303 = 0 by 
the homogeneous linear equations in the coefficients a1; — a22 = 0 and ajzg = 0. (A circle with 
center (c1,¢c2) € R* and radius r has equation x} +73 — 2c)7123 — 2cov9x3 4 (cj +c —r?)x3 = 0.) 
Recall that a conic corresponds to a point in the 5-dimensional projective space of the vector 
space of quadratic forms. So a non-degenerate conic is a circle if is lies in a particular 3-plane in 
that projective space. Now the three pairs of lines correspond to points, and the three quadru- 
ples of intersection points correspond to pencils of non-degenerate conics, that is, to three lines 
connecting these points. If the 3-plane of circles intersects two of these lines, it also intersects 
the third. Thus the theorem is reduced to the fact that if a line in RP” intersects two sides of a 
triangle it also intersects the third. 
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Mobius geometry 


Elementary model 


Consider R” with the standard Euclidean scalar product (x,y) = 0} riyi- 
Reflection in a hyperplane {x : (x — a,v) = 0} is the map 


(t — 4, v) 


eroa’ =x-2 
(v, v) 


Reflection (or inversion) in a hypersphere with center c and radius r is the 


map 


2 


F i 
Se x Cc). 
? ae ae 


Note that 2’ lies on the same ray emanating from c and ||a—ce||-||2’—c|| = r?. 
Inversion in a sphere is an involution, except that the center c has no image 
and no preimage. We fix this by adding one extra point, oo, to R” and 
we declare it to be the image and preimage of c. We also declare that 
reflections in hyperplanes map co to oo. Then both kinds of reflections are involutions on 
R” U {oo}. 


A Mobius transformation of R” U {oo} is a composition of reflections in hyperplanes and hyper- 
spheres. The MGbius transformations form a group called the Mébius group denoted by Méb(n). 
A Mobius transformation is orientation reversing or preserving depending on whether it is the 
composition of an odd or even number of reflections. The subgroup of orientation preserving 
Mobius transformations is called the special Mobius group and denoted by SM6b(n) or Méb* (n). 


The Mobius group contains all similarity transformations: 


e A translation x +> x + v is the composition of two reflections in parallel hyperplanes. 

e An orthogonal transformation «++ Ax with A € O(n) is the composition of at most n reflec- 
tions in hyperplanes through the origin. (Can you prove this?) 

e A scaling transformation x +> Ax with X > 0 is the composition of a reflection in the unit 
sphere followed by a reflection in a sphere with center 0 and radius V/. (Check this.) 


Proposition. A Mébius transformation maps any hyperplane or hypersphere to a hyperplane or 
hypersphere. 


Proof. This is true for all similarity transformations. (These map hyperplanes to hyperplanes 
and hyperspheres to hyperspheres.) A reflection in a sphere with center c and radius r is the same 
as the similarity transformation 7 > +(x —c) mapping the sphere to the unit sphere, followed 
by inversion in the unit sphere, followed by the inverse similarity transformation 7H rx+c. 
(Check this.) So it remains to show that inversion in the unit sphere maps spheres and planes 
to spheres and planes. One could consider hyperspheres and hyperplanes separately but we will 
treat both cases simultaneously. Any hypersphere or hyperplane is determined by an equation 
of the form 
plia|? —2(v,2)+q=0 with ull? — pg > 0. 


If p = 0, the inequality implies v 4 0 so the equation describes a hyperplane. If p 4 0, it describes 
a hypersphere. Indeed, divide through by p to obtain 


0 = |lel|? — 2(5v, 2) + 2 = lle — Sul? — Zallull? + 2. 
This is a sphere with center Zu and radius , / = lull? = - . (The assumed inequality ensures that 


the expression under the square root is positive.) Now for x’ = Tez x one obtains 
pla’? —2v,2')+q=0 => — gla? —2(v,2)+p=0. 


So x’ is contained in a particular hyperplane or hypersphere if and only if x is contained in some 
other hyperplane or hypersphere. 
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From now on we will consider hyperplanes as a special cases of hyperspheres that contain co. So 
hypersphere will mean hypersphere or hyperplane. 


Proposition. Any bijective map f : R" U {co} > R"U {co} which maps hyperspheres to hyper- 
spheres is a Mobius transformation. 


Proof. (i) Suppose f(oo) = co. Then f maps hyperplanes to hyperplanes. Then it also maps 
lines to lines, because a line is the intersection of n—1 hyperplanes. By the fundamental theorem 
of projective geometry (or rather the corollary of it, see Lecture 19), the restriction fg» is an 
affine transformation. Since it also maps spheres to spheres it must be a similarity. 

(it) Suppose f(co) = c 4 oo. Let g be the inversion in a sphere with center c. Then go f also 
maps hyperspheres to hyperspheres and also co to oo. By (i) it is a similarity transformation, 
so f =gogof is a Mobius transformation. 


Proposition. The Mobius transformations are conformal. 


Proof. Since the similarity transformations are conformal it remains only to show that inversion 
in the unit sphere is conformal. Let t +> y(t), t+ 7(t) be two parameterized curves intersecting 
in y(to) = n(to). The intersection angle a is determined by 


(7'(to), 0'(to)) 


cos @ = 
lly’ (to) IIIl77’ (to) 
Let ¥ = aa Y7= aa 7, be the image curves after inversion in the unit sphere. One finds 
that 1 
al / / 
Ye A WY 27) 5 
mayen = Bena) 
and similarly for 7’. From this one obtains (4’,4’) = BayeViy)s so ||4’|| = Te \|>’||, and in 


the same way ||7’|| = Tarlla'I- Using 7(to) = n(to) =: p one finds that 


ol 
IIpI|* 


(7'(to),7'(to)) _ _(4" (to), (to) 


II’ (to) IN ll’(to) 11-14’ to) IN (to) 


(4 (to),  (to)) (7 (to), 0 (to)) 


and hence 


Two-dimensional Mobius geometry 


This case is special because we can identify R? with C, and R?U {co} with the extended complex 

plane G = CU {oo}, which is the same as CP!, the complex projective line (see Lecture 13). The 

orientation preserving and reversing similarity transformations are z+ az+b and z+> az+b 

(a # 0), reflection in the real line is z +> Z, and inversion in the unit circle |z| = 1 is the map 
z 1 


ze Iz? = 


z . 
Proposition. The orientation preserving and reversing Mobius transformations of C = CP! are 
precisely the maps of the form 


az+b az+b 
and zo — 
cz+d cz+d 


with det (*5) =ad—be £0. 


Ce ee 


Proof. First, these transformations form a group: The transformations of the first kind are the 
projective transformations of CP!, and the transformations of the second kind are compositions 
of these with complex conjugation z+ Z. (Note that first performing a transformation of the 
first kind and then complex conjugation also leads to a transformation of the second kind.) This 
groups contains the similarity transformations and inversion in the unit sphere, so it contains the 
Mobius group. On the other hand, it is not bigger than the Mobius group, because any of these 
transformations is a composition of reflections and similarity transformations: If c = 0, they are 
just similarity transformations. Otherwise, this follows from 


az+b _ Ss, bc — ad 
czt+d ec e(ez+d) 


and the equation obtained by replacing z by Z. 
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Two-dimensional Mébius geometry (continued) 


We have identified the extended real plane R? U {co} with the complex projective line CP! = 
C = C7 U {oo}, and we have seen that the orientation preserving Mébius transformations are 
the complex projective transformations of CP!: $Méb(2) = PGL(2,C). This has the following 
immediate consequences: 

e The orientation preserving Mobius transformations of the plane preserve the complex cross 


(21=22)(%3—24) 
(22-23) (za—21) 


then cr(f (21), f(z2), f(¢3), f(24)) = er(Z1, 22, 23, 24). 

e For any three points 21, 22, z3 and any three points w 1, w2,w3, there is a unique orientation 
preserving Mobius transformation f € PGL(2,C) with f(z;) = w;. There is also a unique 
orientation reversing one mapping z; + w,;, namely f followed by an inversion in the circle 
through wy, we, w3. 

e Four points 21, 22, 23, z4 lie on a circle if their cross ratio is real. Moreover, they are in that 
cyclic order on the circle if er(21, 22, 23, 24) < 0. 


ratio cr(21, 22, 23, 24) = If f is an orientation reversing M6bius transformation, 


When we talked about the Poincaré disk and half-plane models of the hyperbolic plane, we did 
not say what the corresponding hyperbolic isometries were. We can do so now: 


e The isometries of the hyperbolic plane in the Poincaré disk model are the Mobius transforma- 
tions that map D? + D?. These are the maps 


az+b az+b 

- — and Zr = 

bz +4 bz +4 

e The isometries of the hyperbolic plane in the half-plane model are the Mobius transformations 
that map the upper half-plane Im z > 0 to the upper half-plane. These are the maps 


with a,b€C and |al? — |b)? >0. 


ae! with ad—bc>0 and sin eee 
b cz + 


and a,b,c,d € R in both cases. 

e The Poincaré disk and half-plane models are related by the map D? > {z € C | Imz > 0}, 
Ze a. . Indeed, this maps 0 i, LH 1, iH co, —1H —-1, -—i++ 0. So it maps the circle 
through se i, -1,—7 (the unit circle) to the ele Gircliek 1,00, —1,0 (the real line); and since 
it maps 0 to 2, it maps the inside of the unit circle to the upper half-plane. 


ae 


with ad— bce <0, 


The projective model of M6bius geometry 


Stereographic projection maps R”U{oo} to the n-dimensional sphere S” C R"+! C RP™*?. (The 
point co is mapped to the north pole.) Hyperspheres in R” U {oo} are mapped to hyperspheres 
in S$”, which are intersections of S” with hyperplanes of RP"+!. The Mobius transformations 
of R” U {oo} are characterized by the property that they map hyperspheres to hyperspheres. 
Hence they correspond to those transformations of S” that map intersections with hyperplanes 
to intersections with hyperplanes. The projective transformations of RP"+! that map S$” > S$” 
have this property. Hence PO(n+1,1) C M6éb(n). In fact, these are all MGbius transformations: 
PO(n + 1,1) = Méb(n). To see this let us examine which transformations of S” correspond 
to the scalings, translations and orthogonal transformation of R” and to inversion in the unit 
sphere. 


First, S” C RP"* is 
= {[2] Ee RP**! | ¢ € R"*?, (z,z) = 0}, where (x,y) = ia LiYi — Uns2Yn42- 
So we can write (v,y) = «7 Ey, where E = Ce a) and I, is the n x n identity matrix. So 
(Ax, Ay) = 27 AT EAz and hence 
Aé€O(n+1,1)— ATEA=E => A'1=EA'E., 
Stereographic projection R” — S” is 


2u 
es 1 ( 2u ) in hom. (u u) ach 
(u, u) +1 ( ,U) —1) coords (u, u) 4 


where (-,-) is the standard Euclidean scalar product of R”. 
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Orthogonal transformations ut-+ Mu, M € O(n). Stereographic projection maps 
2Mu 2u 
Mur> cai = (448) (wih) and (438) € O(n +1,1). 
(Mu,Mu)+1 001 (u,u)+1 GO 
Scalings u++ Au. Stereographic projection maps 
2r\u 2u In 0 0 Day 
AuK> [asc | = [ress | = ( 0 3(A+%) 10-9) (wth). 
A? (u,u)+1 A(u,u)+ + 0 5 (A—t) 4(\+4) (u,u)+1 
ee 


To see that S(\) € O(n + 1,1), check that $~1(A) = s(4y and S(4) = EST(A)E. 
Translations ut++ u+v. Stereographic projection maps 


2ut+2vu In —vU u 2u 
utures  (o 2(u,v)+(v,v) | = (3 1-$(v,v) $(v,v) ) ( cat )| . 
(u,u)+2(u,v)+(v,v) +1 ot —$(v,v) 14+4(v,v) (u,u)+1 
$<, —_“<__’ 


To see that T(v) € O(n + 1,1), check that T~1(v) = T (5) and T(-v) = ETT (v)E. 
Inversion in unit sphere uty aa u. Stereographic projection maps 


2 
1 Cu,uy 2u In 2u In 
ur > | wart} = fle) |} = S10 ){ (ua , and [ 0 =e € O(n + 1,1). 
(u, u) ae 1+(u,w) 001 (uu) +1 001 


(u,u) 


Thus we have the correspondences: 

elementary model projective model 
R” U {oo} +> 8” C RP” 
Mob(n) +> PO(n + 1,1) 


nm polarity 


hypersphere C R” U {oo} > hyperplane C RP”*" intersecting S” °¢"4” point outside S” 


Hyperspheres (including hyperplanes) in R” U {oo} correspond to points outside S$”, that is to 
points [s] with (s,s) > 0. Indeed we had seen that hyperspheres in R” U {oo} are determined 


2u 
by an equation of the form p(u, u) — 2(u,v) + q = 0 with (v,v) — pq > 0. For « = (wth), 


—vU 
1 


this equation is equivalent to (x,s) = 0 with s = ( iia In particular, a hypersphere in 
—apta 
—c 
R” with center c and radius r corresponds to the point [s] with s = a( a0 Garr) ) for some 
3(-1-(c,c)+r*) 
#0. This can be used to show: 


Proposition. The hyperspheres corresponding to two points [s1], [82] outside S* intersect at an 
(81,82) = (Since the right hand side is only determined up to 


$1,81) (82,82 


sign, the angle @ is only defined up to 0 + 1 — 0.) 
Proof. First check that (s;,;) = \?r? and 


angle @ determined by cos @ = 


1 
(81, $2) = rari + te _ (cy — €2,C1 — C2)). 


Sy (81,82) _ r2+r3—(c1—c2,¢1—-¢2) 


(81,81)(82,82) arire 


Now use (ci — c2,¢1 — C2) =r? +73 — 2rir2 cosa (see Figure). 


Mobius geometric pencils of circles 


Lines in RP? correspond to 1-parameter families of 
circles in R? U {co}. The left figure shows two 
such families corresponding to a line intersecting S? 
(blue) and the polar line, which does not intersect S$? 
(green). The right figure shows two families corre- 
sponding to two polar lines tangent to $?. In both 
cases, the blue and green circles are orthogonal to 
each other. 
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Relationship between Mobius and other geometries 


Mobius geometry deals with properties of figures in S" C RP"*+ that are invariant under the 
group PO(n + 1,1) of projective transformations of RP"*! that map S$" > $”". Thus, n- 
dimensional Mobius geometry is a subgeometry of (n + 1)-dimensional projective geometry. The 
same group, PO(n+1,1), also maps B"*! (the inside of S$”) to itself. This gives the Klein model 
of (n+1)-dimensional hyperbolic geometry. So n-dimensional Mébius geometry can be seen as 
the geometry of the points in the ideal boundary of (n + 1)-dimensional hyperbolic space. 


For a point P = [p| € RP”, let Gp be the subgroup of PO(n + 1,1) consisting of all projective 
transformations that map P +> P (in addition to mapping S$” — S$"). These also map the polar 
plane of P to itself. 


If P is outside S”, then the polar plane intersects B’+!, and the geometry of this intersection 
with the group Gp is n-dimensional hyperbolic geometry. 


If P is the center of S”, then the polar plane is the plane at infinity, so Gp is the group of affine 
transformations mapping S” to itself. This is the group of orthogonal transformations. So the 
space S” with the group Gp is n-dimensional spherical geometry. If P is any other point inside 
S”, one obtains a Mobius geometrically equivalent model for n-dimensional spherical geometry. 


If P is the north pole of S”, then Gp corresponds (via stereographic projection) to the Mébius 
transformations of R” U {oo} that fix oo. These are the similarity transformations. Thus, $” 
with Gp is a model for n-dimensional similarity geometry. If P is any other point in S”, one 
obtains a Mobius geometrically equivalent model of similarity geometry. 


If P € S”, the group Gp consists of all projective transformations that come from orthogonal 
maps A € O(n + 1,1) with Ap = Xp for some \ € R \ {0}. Because p is a lightlike vector \ is 
not always equal to +1. (For example consider the orthogonal transformations that correspond 
to scalings in R” U {oo}; see last lecture.) If instead of Gp, one considers the (projectivized) 
group of all A € O(n + 1,1) with Ap = p, then one obtains a model for n-dimensional Euclidean 
geometry. 


(n + 1)-dimensional projective 
RP"t!, PGL(n + 2,R) 


peer V 


(n + 1)-dimensional hyperbolic n-dimensional Mobius 
points inside $”, PO(n + 1,1) S*® CRP*™!, PO(n+1,1) 


Y ee ata 


n-dimensional hyperbolic n-dimensional spherical n-dimensional similarity 


| 


n-dimensional Euclidean 


The paraboloid model of Mobius geometry 
Stereographic projection maps a point u € R” to the point 
[2ujer +... + 2unen + ([lull? — Densi + (lull? + Dense] 
in S” C RP”+!. Now let 
ea= $(En41 + €n42), €9 = $(—en41 + €n42). a \ Ql 


The subscripts 0 and oo are chosen because [ep] and [e..] are / cy \ 
the “south pole” and “north pole” of S”, and these are the ve - > —_ — 
images of 0 and oo under stereographic projection. Written 

in the basis B = (€1,...,€n,€o0, 0), the image of u is 


[ures cab upen + |l2l|?e0o + €0]- 


If we divide by the eo-coordinate to dehomogenize, this maps u to the point (u, ||ul|?) on the 
paraboloid uy+1 = ||ul|?. So in the new coordinates, stereographic projection becomes vertical 
projection to a paraboloid. 
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Nie 


For the new basis vectors, the Lorentz scalar product is (€9, €9) = (€o00; €o0) = 0, (€9, Cao) = — 
So the matrix of the Lorentz scalar product in the new basis B is 


The new basis B is not orhonormal. The polar plane of [e..] is the set of points [yiey +... + 
Ynen + Yoo@oo + yoeo] with yo = 0. The paraboloid model obtained by using yo to de-homogenize 
is related to the sphere model by a projective transformation that maps the tangent plane of S” 
in the north pole to the plane at infinity and, consequently, the sphere to a paraboloid. 


Because the paraboloid model is just a different projective image of the sphere model, a sphere 
in R” U {co} corresponds to the intersection of the paraboloid with a plane, which corresponds 
via polarity to a point outside the the paraboloid. Let us derive an explicit formula for the 
correspondence between spheres in R” and points outside the paraboloid. Let c € R” and r > 0 
be the center and radius of a sphere in R”. A point u € R” belongs to the sphere if it satisfies 
the equation 

||ul|? — 2(c, u) + lel]? — 1? =0. 


Let (Y1,---5Yns Yoo: Yo) = A(u1,---, Un; ||ul|?, 1) be homogeneous coordinates of the image point 
on the paraboloid. In terms of these coordinates, the sphere equation is linear: 


Qcry1 — ... — WnYn + Yoo + (|lell? — 7?) yo = 0. 


It is convenient to divide by —2 to obtain 


C1y1 +. FCnYn 5 Yoo + (llell? Tyo = 0, 


because this can be written 


( Sy cies + (Mell? — reco +0, OT Hes + Yoo@oo + yoko ) = 0. 


So in homogeneous coordinates with respect to the basis e1,...,€n, €0; Coo, 2 Sphere with center c 
and radius r corresponds to the point 


[er seen lIell? = i 1). 


Note that (after de-homogenization), the point corresponding to the circle with center c and 
radius r is at distance r? directly below the point of the paraboloid corresponding to the center c. 


In the same way, the hyperplane in R” with equation (v,u) — b = 0 corresponds to the point 


[v1,.--,Un,2b,0]. Thus, in the new coordinates, hyperspheres and hyperplanes correspond to 
points [51,...,8n, S00, 80] with s?+...+ 87 — 5.89 > 0. If so = 0, the point corresponds to 
a hyperplane in R” with normal vector (s1,...,8). If s9 4 0, it corresponds to a sphere with 


center c and radius r determined by 
1 2_ 1,2 


c= —(S1,..-,8n), re = — (8? 4.2.4 8? = sss). 
50 80 
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Lie geometry 


Suppose we distinguish between between differently oriented 
spheres. An unoriented sphere corresponds to two oriented 
spheres which consist of the same points but differ in their 


orientation. The different orientations can be visualized by (AI 
drawing arrows pointing inwards or outwards as shown in = ee 6s 
the figure. Let us define a signed radius for oriented spheres C4 
by saying that the signed radius is just the radius if the 


arrows point outward, and minus the radius if the arrows 
point inward. We define coordinates for oriented spheres by 
appending the signed radius to the Mobius geometric coordi- 
nates for spheres. Thus, in the basis of the paraboloid model, 
an oriented sphere with center c and signed radius r has homogeneous coordinates 


r>0 r<0O 


[c1, v04@ny lel]? —¢, Ler] 
These homogeneous coordinates are not independent anymore, because the first n + 2 already 
determine r?. A point [y1,---,Yn;Yos Yoos ¥n+3] Corresponds to an oriented sphere only if 
Yi +--+ +Yn — YoYo — Yn43 = 0. (1) 


This is the equation of a quadric with signature (n + 1,2) in RP”*? called the Lie quadric. We 
have thus established a correspondence between the oriented spheres in R” U {co} and the points 
in the Lie quadric. 


An oriented sphere in R” U {00} with center c and signed radius r corresponds to the point 


[v1 -++5 Uns V0, Yoos Yn+3] = [ex, seen; Ile||? i at 1,7] 
in the Lie quadric. 


While Lie geometry distinguishes between differently oriented spheres, it does not not distinguish 
between spheres and points. A point is just a sphere with radius r = 0, corresponding to a point 
in the Lie quadric with y,43 = 0. Consequently (and reassuringly) points do not come with two 
different orientations. 


A oriented hyperplane in R” with unit normal vector n and equation 
(n,u) —d=0 (2) 


can be seen as the limit of a sphere with radius r and center (d—1r)n as r + oo. For the 
corresponding points in the Lie quadric one has 


[(d—r)n, (d—r)? =r? 1,7] a [(¢ 1)n & 2d, 11] pas [ n, 2d,0, 1]. 


Oe 


Therefore, the oriented hyperplane with unit normal vector n and equation (2) correspond to 
the point 
[ — n, —2d, 0, 1] 


in the Lie quadric. 


Question: Which oriented sphere in R” does a point 


[y] = [M1 -++9Yns Yoo: Yo: Yn+3] 


in the Lie quadric (1) represent? The answer depends on whether one or both of the coordiantes 
yo and yn+3 are zero. We consider the possible cases separately: 


e yo £ 0 and yz+3 4 0: The point [y] represents an oriented sphere with center c and signed 
radius r, where 


1]. — Yn+3 
: — . 
Yo : Yo 
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© yo = 0,Yn+3 #0: The point [y] represents an oriented plane with unit normal vector n and 
equation (2), where 


Y1 
1 Yoo 
n= . > d =——_ . 
Yn+3 \ * 2 Yn+3 
Un 
© yo £0, Yn4+3 = 0: The point [y] represents the point 
Y1 
1 
—]: | ER”. 
Yo \ ~ 
Yn 
© Yo = Yn+3 = 0: In this case, equation (1) implies Y1 =... = Yn = 0, and hence [y] = [eo]. 


The point [y] = [eo] represents the point oo € R. 


A Lie transformation is a transformation of the set of points and oriented spheres in R” that 
corresponds to a projective transformation of RP”+? that maps the Lie quadric to itself. Hence 
the group of Lie transformations is PO(n + 1, 2). 


A Lie transformation does in general not map points to points. Indeed, points in R" correspond 
to the points [y] in the Lie quadric with y,43 = 0. This condition is in general not preserved by 
a projective transformation that fixes the Lie quadric. So Lie transformations are very different 
from the geometric transformation groups we have considered up to now. They act on the space 
of points and oriented spheres. They do not act on a space of points. In short (and in italics): 


Lie transformations are not point transformations. 
How can these transformations be described geometrically? Let us examine what it means for 


the corresponding oriented spheres if two points in the Lie quadric are polar to each other. 


The bilinear form corresponding to the quadratic form (1) is 


(y, Y)L =Yfit.--+Yn In - 5 Yoo Yo — 5Yo Yoo — Yn+3 Yn43- 


(Its signature is (n + 1,2).) Two circles with centers c, ¢ and radii r, 7, respectively, correspond 
to the points 
[yl =[e, lle? -r?,1, 7] and [g] = [é, lll? - 7,1, #] 


in the Lie quadric. These are polar to each other with respect to the Lie quadric if 
0= (y,9) 
= (ee) = 9 lel" =r = lel =F) ar 
= —3(lle— 4? — © -#)’). 


This is equivalent to the condition 


a 
lle- al = |r - Fi, Pr 
~« e eo ~@ 
which is fulfilled precisely if the spheres touch and the orientations of the ps 


spheres agree in the point of contact (this means that the arrows point 
in the same direction). Indeed, if the signs of the radii are different (as in 


the top figure) then the condition for oriented contact is ||c—é|| = |r|+|F|; 
if the signs are equal (as in the bottom figure) then the condition is 
lle — él] = ||r| — |#||. Thus: 


Two points in the Lie quadric are polar to each other if the corresponding 
oriented spheres are in oriented contact. 


You may want to convince yourself that this is true also for the case 
when one or both spheres are in fact hyperplanes or points. For a point and a sphere, oriented 
contact means that the point is contained in the sphere. An immediate consequence of this is: 


69 


Boris Springborn Geometry I Lectures 31-99 Winter Semester 07/08 


A Lie transformation maps spheres in oriented contact to spheres in oriented contact. 
In fact, the following is true (although I will not prove this): 


Any transformation of the space of spheres which maps spheres in oriented contact to spheres in 
oriented contact is a Lie transformation. 


So Lie geometry is the geometry in the space of oriented spheres that studies invariants under 
the group of transformations that preserve oriented contact. 
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